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Logical  Dependence  of  Chapters 

This  study  extends  over  several  areas.  For  the  benefit  of 
readers  interested  in  one  particular  subject,  certain 

] 

Chapters  can  be  read  independently  of  the  others.  The 

logical  dependence  of  the  Chapters  is  shown  below.  j 

j 

1 l 

2 >5 
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= A must  be  read  before  B 
= Results  from  A are  used  in  B, 
but  it  is  not  essential  to  read 
A before  B. 
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NOTATION 


We  state  some  conventions  here,  which  will  eliminate  the 
need  for  minor  oefinitions  throughout  this  work.  Underlined 
lower-case  letters  represent  column  vectors . Upper-case 
letters  may  represent  matrices , sets  , or  constants . 
Subscripts  on  a symbol  usually  denote  a component  of  the 
corresponding  vector  or  matrix  that  the  symbol  represents, 
for  example 

Xj  is  the  j component  of  the  vector  £ 

is  the  (i,j)  component  of  the  matrix  A. 


Superscripts  will  be  used  to  differentiate  between  symbols 

7 2 k 

of  the  same  type,  for  example  x , x , x . 


Vector  inequalities  are  to  be  interpreted  componentwi se , 
that  is 

a < b means  a^  < b^  for  all  i. 

The  zero  vector  will  often  be  denoted  simply  by  0,  and  its 
dimension  will  be  apparent  from  the  context. 


The  notation  x'  denotes  the  transpose  of  the  vector  x. 
En  denotes  the  n-d imensional  Euclidean  vector  space. 
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Synopsis 

This  work  studies  the  application  of  decentralization 
to  the  problem  of  Resource  Management  in  Large  Systems.  In 
an  operational  system,  where  a very  large  number  of 
activities  share  limited  resources,  this  Resource  Management 
problem  has  three  objectives.  The  first  ("Initial 
Allocation")  is  to  find  an  assignment  of  resources  to  every 
activity,  such  that  all  the  system  constraints  are 
satisfied,  and  all  activities  are  operating.  The  second 
( "Ne w-Ass ignment" ) is  to  find  a rationale  for  allocating 
resources  to  new  activities.  Mew  activities  are  initiated 
frequently  enough  that  we  do  not  wish  to  re-solve  the  entire 
problem  for  the  combined  set  of  old  and  new  activities.  The 
third  objective  ("Periodic  Review")  is  to  find  an  efficient 
way  of  re-allocating  resources  in  order  to  reflect  the 
changing  needs  of  the  individual  activities,  as  well  as  the 
changes  in  total  resource  usages. 

Conventionally,  the  resource-allocation  problem  has 
been  studied  for  the  case  where,  in  addition  to  the 
constraints,  there  exists  an  objective  to  be  maximized.  Our 
emphasis,  as  is  reflected  by  the  title  of  this  work,  is  on 
the  feasibility  aspect  of  the  problem,  that  is,  of  taking  a 
large  system  and  keeping  it  operational.  We  show  that  this 
in  itself  is  both  an  important  problem,  and  has 
theoretically  interesting  consequences.  In  addition,  our 
results  can  be  useful  for  tne  solution  of  the  general 
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(optimization)  problem. 

Chapter  2 provides  the  motivation  for  our  study.  The 
Chapter  is  divided  into  two  Parts.  Part  I describes  the 
Resource  Management  problem  in  a large  FIAT  spare-parts 
warehouse.  We  discuss  why  the  warehouse  management  is 
satisfied  with  a feasible  solution,  and  do  not  require  any 
notion  of  optimality.  The  size  of  the  problem  (there  are 
60,000  par t-number s , and  30-50  new  part-numbers  are  to  be 
allocated  every  day)  , and  the  fact  that  the  resource  usage 
functions  are  discontinuous  and/or  nonlinear  and/or 
nonconvex,  make  the  problem  a hard  one.  We  discuss  why 
standard  techniques  are  not  suitable.  Part  II  of  Chapter  2 
advocates  a decentralized  approach  to  the  problem.  We  show 
now,  by  replacing  the  feasibility  problem  by  a suitable 
"Artificial”  optimization  problem,  we  can  use  Lagrange 
Multipliers  to  simplify  the  solution  through 
decentralization  of  decisions.  We  then  show  how  this 
decentralized  approach  offers  substantial  advantages  over 
standard  techniques,  with  regard  to  each  of  the  three 
objectives  above. 

Although  the  use  of  Lagrange  Multipliers  for 
decomposition  of  large  problems  is  well  known,  the  main 
drawback  of  this  technique  is  the  existence  of  "duality 
gaps".  In  Chapter  3 we  present  a Theorem  giving  simple 
conditions  which  guarantee  the  ex istence  of  optimal 
multipliers.  Such  results  have  previously  been  given  only 
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under  strict  conditions.  We  are  able  to  completely  remove 
restrictions  on  the  form  of  the  individual  resource  usage 
functions,  replacing  them  instead  by  a condition  based  on 
their  magnitudes  alone.  This  result  gives  a firm 
theoretical  basis  to  our  approach,  since  it  justifies  the 
use  of  our  technique. 

In  Chapter  4 we  develop  iteration  algorithms  to  solve 
the  "Initial  Allocation"  problem  in  a decentralized  manner. 
These  algorithms  have  proveable  convergence  properties,  and 
quadratic  convergence  rates.  Again,  although  decentralized 
iteration  algorithms  have  been  described  in  the  literature, 
their  convergence  has  required  strict  conditions  on  the 
functions  (convexity,  continuity).  We  are  able  to  extend 
our  proofs  to  more  general  functions.  A miner  but 
interesting  contribution  of  this  Chapter  is  the  Selection 
Algorithm.  This  is  a simple  and  intuitively  appealing 
method  of  solving  certain  inequality  problems,  yet  its 
solution  possesses  useful  minimum-norm  properties. 

In  Chapter  5 we  describe  the  design  of  a practical 
Resource  Management  system,  and  give  examples  from  the 
system  designed  for  the  FIAT  warehouse.  Chapter  6 
illustrates  the  applicability  of  our  methods  to  the  problem 
of  optimal  file  allocation  in  distributed  computer  systems. 


Our  conclusions,  presented  in  Chapter  7,  advocate  a new 
approach  to  Large-Scale  systems.  Based  on  the  methods  used 
in  this  work,  we  suggest  that  analysis  of  such  systems 
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should  depend  on  the  "global"  properties  of  the  systems,  and 
be  insensitive  to  the  "local"  properties  of  the  systems. 
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CHAPTER  1 
INTRODUCTION 

In  an  era  where,  due  to  rapid  advances  in  technology, 
we  are  seeing  greater  and  greater  interconnection  between 
systems,  the  study  of  large-scale  systems  is  assuming  a new 
importance.  Along  with  this  has  come  the  realization  that 
in  most  applications  practicality  calls  for  decentralized 
control  of  such  systems.  The  importance  of  these  two  facts 

is  reflected  by  the  recent  devotion  of  an  entire  issue  of  a 

prominent  journal*  to  the  topic  of  "Large-Scale  Systems  and 

Decentralized  Control".  In  this  work  we  study  the 

application  of  decentralization  to  one  aspect  of  such 
systems,  namely,  the  problem  of  Resource  Management  in  Large 
Systems. 

In  a large  operational  system,  where  a very  large 
number  of  activities  share  a number  of  limited  resources, 
this  Resource  Management  problem  has  three  main  objectives. 
The  first  (the  "Initial  Allocation"  or  "Design"  problem)  is 
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to  find  an  assignment  of  resources  to  every  activity,  3uch 
that  all  the  system  constraints  are  satisfied,  and  all 
activities  are  operating,  enabling  the  system  as  a whole  to 
operate.  The  second  (the  "New-Assignment"  problfi)  is  to 
find  a rationale  for  allocating  resources  to  new  activities. 
It  is  presumed  that  new  activities  are  initiated  frequently 
enough  that  we  do  not  wish  to  re-solve  the  entire  problem 
for  the  combined  set  of  old  and  new  activities.  The  third 
objective  ("Periodic  Review"  problem)  is  to  find  an 
efficient  way  of  re-allocating  resources  in  order  to  reflect 
the  changing  needs  of  the  individual  activities,  as  well  as 
the  changes  in  total  resource  usages. 

Conventionally,  the  resource-allocation  problem  has 
been  studied  for  the  case  where,  in  addition  to  the 
constraints,  there  exists  an  objective  to  be  maximized.  Our 
emphasis,  as  is  reflected  by  the  title  of  this  work,  is  on 
the  feasibility  aspect  of  the  problem,  that  is,  of  taking  a 
large  system  and  keeping  it  operational  (maintaining  it  in 
the  feasible  region).  We  shall  see  that  this  in  itself  is 
both  an  important  problem,  and  has  theoretically  interesting 
consequences.  In  addition,  our  results  can  be  useful  for 
the  solution  of  the  general  (optimization)  problem. 


In  Chapter  2 we  shall  provide  the  motivation  for  our 
study.  The  Chapter  is  divided  into  two  Parts.  Part  I 
describes  the  Resource  Management  problem  in  a large  FIAT 
spare-parts  warehouse.  It  was  the  tackling  of  this  problem 
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that  led  to  the  work  presented  here.  Hence  an  appreciation 
of  the  original  problem  leads  to  a deeper  understanding  of 
the  abilities  of  our  solution  technique.  The  resource 
management  problem  in  the  warehouse  is  exactly  the  general 
one  described  above,  with  the  three  objectives  corresponding 
to  (i)  Initial  Allocation  of  all  parts,  (ii)  Allocation  of 
new  parts,  and  (iii)  Periodic  Reallocation  of  parts.  We 
discuss  in  detail  why  the  warehouse  management  is  satisfied 
with  a feasible  solution,  and  do  not  require  any  notion  of 
optimality.  The  size  of  the  problem  (there  are  60,000 
part -numbers , and  30-50  new  part-numbers  are  to  be  allocated 
every  day)  , and  the  fact  that  the  resource  usage  functions 
are  discontinuous  and/or  nonlinear  and/or  nonconvex,  make 
the  problem  a hard  one.  We  discuss  why  standard  techniques 
(Linear  Programming,  Integer  Programming)  are  not  suitable. 

Part  II  of  Chapter  2 advocates  a decentralized  approach 
to  the  problem.  We  show  how,  by  replacing  the  feasibility 
problem  by  a suitable  "Artificial”  optimization  problem,  we 
can  use  Lagrange  Multipliers  to  simplify  the  solution 
through  decentralization  of  decisions.  We  then  show  how 
this  decentralized  approach  offers  substantial  advantages 
over  standard  techniques,  with  regard  to  each  of  the  three 
objectives  above.  This  provides  us  with  the  basic 


motivation  for  further  investigation  of  this  approach. 
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Although  the  use  of  Lagrange  Multipliers  for 
decomposition  of  large  problems  is  well  known,  the  main 
drawback  of  this  technique  is  the  existence  of  "duality 
gaps".  This  means  there  may  not  exist  multipliers  which  can 
generate  the  optimum  to  the  Artificial  Problem  above.  In 
Chapter  3 we  present  a Theorem  giving  simple  conditions 
which  guarantee  the  existence  of  optimal  multipliers.  Such 
results  have  previously  been  given  only  under  strict 
conditions  (e.g.  convexity,  continuity).  We  are  able  to 
completely  remove  restrictions  on  the  form  of  the  individual 
resource  usage  functions,  replacing  them  instead  by  a 
condition  based  on  their  magnitudes  alone.  This  result 
gives  a firm  theoretical  basis  to  our  approach,  since  it 
justifies  the  use  of  our  technique  in  many  cases  where  the 
resource  usage  functions  are  ill-behaved  and  the  domains  of 
the  decision  variables  are  nonconvex. 

In  Chapter  4 we  develop  iteration  algorithms  to  solve 
the  "Initial  Allocation"  problem  in  a decentralized  manner. 
These  algorithms  have  proveable  convergence  properties,  and 
quadratic  convergence  rates.  Again,  although  decentralized 
iteration  algorithms  have  been  described  in  the  literature, 
their  convergence  has  required  strict  conditions  on  the 
functions  (convexity,  continuity).  We  are  able  to  extend 
our  proofs  to  more  general  functions.  A minor  but 
interesting  contribution  of  this  Chapter  is  the  Selection 
Algorithm.  This  is  a simple  and  intuitively  appealing 


method  of  solving  certain  inequality  problems,  yet  its 
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solution  possesses  useful  minimum-norm  properties. 

In  Chapter  5 we  describe  the  design  of  a practical 
Resource  Management  system,  and  give  examples  from  the 
system  designed  for  the  FIAT  warehouse.  We  show  how,  in  an 
operational  system,  our  approach  enables  simple  and 
efficient  solution  of  both  the  " Ne w-ass ignment"  problem  and 
the  "Periodic  Review"  problem.  In  addition,  we  suggest  how 
the  algorithms  in  Chapter  4 can  be  enhanced  by  incorporating 
various  features  useful  in  a practical  application. 

Chapter  6 illustrates  the  applicability  of  our  methods 
to  the  problem  of  optimal  file  allocation  in  distributed 
computer  systems.  This  problem  has  only  been  addressed  in 
the  literature  for  a small  number  of  files  (typically  5 to 
20).  We  show  how,  by  appropriate  formulation,  of  the 
problem,  a decentralized  solution  is  possible.  This  brings 
previously  intractable  problems  (w„th  several  thousand 
files,  say)  within  the  reach  of  known  solution  methods.  Our 
aim  is  to  demonstrate  the  applicability  and  advantages  of 
decentralized  techniques  --  for  the  general  optimization 
problem  we  do  not  go  into  details  of  solution  algorithms, 
but  give  a framework  for  future  research.  We  do  however 
consider  one  case  in  detail  --  this  is  the  problem  faced  by 
a "Network  Manager",  whose  task  is  to  keep  a given  network 
operational  (that  is,  all  resource  usages  within  the 
constraints)  in  the  face  of  constant  arrivals  of  new  files 
and  changing  characteristics  of  old  files. 


Finally,  in  Chapter  7,  we  present  our 


conclud 


remarks 
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CHAPTER  2 

MOTIVATION  FOR  THIS  RESEARCH 


Part  I : Resource  Management  in  a Large  Wa rehouse 

2.1  INTRODUCTION  TO  PART  I 

In  unis  Part  we  motivate  the  reader  by  describing  the 
resource  management  problem  in  a large  warehouse.  It  was 
the  tackling  of  this  problem  that  led  to  the  work  presented 
in  this  study.  Hence  an  appreciation  of  the  original 
problem  leads  to  a deeper  understanding  of  the  abilities  and 
advantages  of  our  approach. 

Ofcourse,  the  application  of  our  work  is  not  restricted 
to  the  specific  problem  in  this  particular  warehouse.  The 
reader  will  see  that  our  models  and  methods  can  be  applied 
to  a variety  of  large  systems  which  face  similar  problems. 
In  Chapter  6,  for  instance,  we  will  discuss  how  our  approach 
could  be  applied  to  the  File  Allocation  Problem  in  large 
distributed  computer  systems. 


* 
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2.2  DESCRIPTION  OF  WAREHOUSE 

Our  interest  in  this  problem  arises  From  a project 
involving  the  author,  along  with  a team  from  CSDL*,  to 
improve  the  operation  of  the  FIAT  Central  Spare  Parts 
Warehouse,  in  Volvera  (Turin,  Italy).  This  Warehouse 
essentially  supplies  spare  parts  to  the  whole  world.  It 
covers  an  area  exceeding  that  of  15  football  fields,  has  an 
inventory  of  over  20,000  tons  (valued  at  several  hundred 
million  dollars),  contains  more  than  60,000  different 
Part-Numbers  (each  of  which  may  occupy  several  containers), 
and  services  about  10,000  orders  every  day  [FI]. 

The  Warehouse  is  divided  into  several  different  areas, 
used  for  stocking  Parts  with  different  characteristics.  For 
instance,  medium-sized  items  with  not  too  high  demand  are 
stocked  in  a "High  Shelf  Area",  where  loading  and  retrieval 
of  containers  is  done  solely  by  computer-controlled  cranes. 


On 

the  other 

hand  , 

very  small,  fast-moving 

items 

are  stored 

in 

an  area 

where 

they  are  hand-picked 

by 

men  with 

hand 

-pushed 

carts 

. Figs. 2-1  to  2-3 

illustrate  the 

characteristics  of  three  such  areas. 


•The  Charles  Stark  Draper  Laboratory,  Cambridge,  Mass. 
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The  "High  Shelf" 
Area 


This  area  is  equipped  with 
very  high  (27  metre)  shelves, 
occupying  a floor  area  of 
144x96  metres,  with  24 
corridors  served  by  automated 
cranes.  The  cranes  are  used 
for  loading  and  retrieval  of 
containers.  Picking  of  parts 
from  this  area  is  done  either 
by  placing  the  containers  on 
conveyors,  leading  to  manual 
picking  stations,  or  by 
retrieving  individual 
containers  down  to  manned 
"bays"  at  the  floor  level. 
Operation  of  all  the  equipment 
in  the  plant  is  totally 
automatic  and  is  controlled  in 
real  time  by  a computer.  The 
storage  capacity  of  this  area 
is  over  120,000  containers, 
and  the  picking  capacity  i3 
over  5,000  picks  per  day.  The 
area  is  designed  for  material 
of  average  dimension,  and  of 
medium  or  high  turnover  and 
withdrawal  frequency. 

(Source:  [FI]) 
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These  shelves  house  a few  hundred  high- turnover  panel  items. 
Since  the  material  has  undergone  protective  treatment,  it  is 
channelled  in  one  direction  only,  to  avoid  getting  damaged. 
Withdrawal  is  thus  on  a first-in,  first-out  basis.  Each 
corridor  serves  one  item  only,  and  the  crates  progress  from 
the  loading  end  to  the  withdrawal  end  by  means  of  robots. 
These  shelves  have  enormous  capacity,  holding  more  than 
30,000  large  crates.  (Source:  (FI]) 
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Flg . 2-3 : Compact  Warehouse 


The  storage  volume  of  these 
large  shelves  (about  50,000 
cubic  metres)  enables  6,000 
tonnes  of  particularly  large, 
low-turnover  material  to  be 
stocked.  The  shelves  run  on 
rails  so  that  corridors  are 
created  on  request  only  at  the 
moment  when  the  fork-lift 
trucks  need  to  load  or 
withdraw  containers.  (A 
safety  system  stops  operations 
whenever  persons  or  material 
are  located  in  the  temporary 
corridors.)  Source:  [FI]. 
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The  servicing  of  daily  orders,  and  the  replenishment  of 
stocks,  makes  use  of  various  resources  in  each  area,  such  as 
cranes,  conveyors,  men,  and  shelf  space.  These  resources 
may  be  particular  to  a given  area  (such  as  shelf  space)  or 
may  be  shared  by  several  areas  (such  as  a conveyor  that 
passes  through  different  areas:  see  Fig. 2-4). 


Fig. 2-4:  Examples  of  Various  Resources 
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Naturally,  these  resources  have  limits  on  their  capacity. 
Upon  reviewing  the  situation  at  the  warehouse,  in  January 
1977,  we  found  the  salient  characteristics  of  the  resource 
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management  problem  to  be  as  follows: 

1.  There  are  several  different  storage  areas,  each 
with  several  container-types , leading  to  1 6 
diff erent  storage-types* . 

2.  Each  storage-type  uses  several  resources,  some  of 
which  are  shared  with  other  storage-types . there 
are  con  strained  resour  ces*  , such  as: 

1.  Storage  Capacity 

2.  Crane  Capacity 

3.  Conveyor  Capacity 

4.  Manual  Picking  Capacity 

3.  There  were  60 , 000  Part-Numbers*  assigned  to  the 
various  storage-types  on  the  basis  of  criteria  that 
were  long  since  outdated  --  demand  patterns  and  the 
Warehouse  operations  had  changed  considerably. 

The  net  effect  of  these  factors  was  bottlenecks  in 
several  resources,  yet  much  spare  capacity  in  others.  This 
meant  that  while  in  some  storage-types  the  daily  demand  (or 


•The  figures  given  here  correspond  to  those  areas  of  the 
warehouse  within  the  scope  of  our  project. 
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storage  requi reme.nts ) could  not  be  met,  in  other 
storage- types  equipment  was  lying  idle**.  Keeping  in  mind 
these  problems,  as  well  as  the  future  operating  requirements 
of  the  warehouse,  the  aims  of  our  'project  were  set  down  as: 

1.  "Get  rid  of  the  bottlenecks".  (Improve  the  current 
allocation  as  quick  as  possible.) 

2.  Develop  a method  for  reviewing  the  situation  (say) 
every  3 months,  and  making  necessary  reallocations 
(Periodic  Review) . 

3.  Develop  a rationale  for  allocating  storage  to  New 
Part-Numbers  (note  that  these  are  not  replenishment 
stocks  for  existing  Parts,  but  Parts  never  before 
stocked,  e.g.  for  a new  car  model). 


2.3  FORMAL  STATEMENT  OF  PROBLEM 

In  this  section  we  develop  a formal  model  of  the 
Storage  Allocation  and  Resource  Management  problem,  and 
indicate  the  factors  that  make  a good  solution  (i.e.  one 
that  fulfills  the  aims  above)  difficult  to  find.  Although 
we  will  state  our  model  in  terms  of  the  Warehouse  above,  the 
reaier  will  see  that  our  model  generalizes  to  other  large 

••Due  to  the  special  design  or  fixed  nature  of  the  equipment 
in  each  area,  it  is  not  generally  possible  to  transfer 
capacity  from  one  area  to  another. 
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systems  (see  Chapter  3,  Chapter  6). 


2.3.1  Notation 

It  is  assumed  that  the  reader  has  read  the  description 
of  our  notational  conventions  (following  the  Table  of 
Contents).  This  will  eliminate  the  need  for  minor 
definitions  throughout  the  Chapter. 


2.3-2  Problem  Formulation 

Let  there  be  I Items  (Part-Numbers)  to  be  allocated  in 
S Stor age-types , such  that  Ft  Resource-usage  constraints  are 

satisfied.  To  facilitate  recall,  we  will  use  the  indices  i, 

^ h t h t*  h 

s,  or  r,  to  represent  the  i item,  s storage-type , or  r~‘ 

resource . 


2.3*2.  1 Item  Allocation  - The  total  quantity  of  item  i is 

Q1 , and  its  other  characteristics  (demand,  weight,  volume, 

etc.)  are  represented  by  a data  vector  d 1 . For  each  item  a 

S-dimensional  decision  x1  needs  to  be  taken,  where  x1  is  the 

— s 

quantity  of  item  i allocated  to  storage  s.  We  will  often 
refer  to  x1  as  an  allocation  of  item  i. 
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2. 3. 2. 2 Resource  Usage  - A given  allocation  for  an  item, 
along  with  the  item's  data  characteristics  (as  above)  will 
result  in  the  use  of  various  resources  (e.g.  storage  space, 
crane-time,  etc.).  We  define  the  resource  Usage  function 
u1(d1,x^1)  to  be  an  R-dimensional  vector  function  such  that 
u^(di,xi)  is  the  usage  of  the  rth  resource  by  an  item  with 
data  d 1 , when  its  allocation  is  x_ 1 . (The  calculation  of 
u(  . , . ) obviously  depends  on  the  "operating  rules"  of  the 
Warehouse.  For  generality,  we  let  these  rules  be  defined 
for  different  items,  hence  the  superscript  i on  u above.) 


2. 3- 2. 3 Total  Allocation  And  Total  Usages  - The  allocation 
of  all  items  will  be  represented  by  the  vector 

x 5 [ (xV,(x2)' (x1)’  ]' 

The  total  resource  usage  by  an  allocation  of  all  items  is 

(2.3.1]  u(  x)  £ JT  u^d1  .x1) 

iTl 

We  will  often  refer  to  u or  u1  as  "usage  vectors". 

2. 3-  2. 4 Constraints  On  Usages  - The  R-dimensional  vector  of 
constraints  on  the  resource  usages  will  be  denoted  by  c, 
that  is  cr=  value  of  constraint  on  usage  of  resource  r. 
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2. 3-2. 5 Statement  Of  General  Problem  - Let  e be  the 
S-dimensional  vector  with  each  component  equal  to  unity, 
i.e.  e = [ 1 , 1 , . . . , 1 ] ' . Then  the  Storage  Allocation  and 
Resource  Management  Problem  can  be  stated  as  the  General 
Problem 


[2.3-2]  (GP):  Find  x = [ ( x 1 ) • (x1)’]' 


■ » 

such  that 

e ' x1 

= 

Ql 

(I 

equations) 

and 

x1 

> 

0 

(S 

x I equations) 

and 

u(  x) 

< 

c 

(R 

equations ) 

Note  that  the  decision  x 

consists 

of  S 

x I 

components . 

2.3-3  Comment  On  Feasibility  Versus  Optimality 

The  astute  reader  will  already  have  noticed  that  the 
problem  (GP),  as  formulated,  only  involves  looking  for  a 
feasible  solution;  no  notion  of  optimality  has  been  stated. 
Some  comment's  on  the  reasons  for  this  are  in  order  here. 

The  first  reason  for  looking  only  for  a feasible 
solution  is  that  the  problem  is  so  complex  (see  next 
section)  that  even  a feasible  solution  is  hard  to  find. 
Thus  we  are  satisfied  if  we  can  generate  such  a solution. 
The  second,  more  satisfactory,  reason  derives  from  the 
warehouse  management's  objectives,  which  are:  to  keep  the 
warehouse  operational,  irrespective  of  the  relative  uses  of 
the  resources,  provided  these  usage  levels  are  within  the 
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limits  laid  down  by  management.*  From  an  economic  point  of 
view  too,  these  objectives  have  an  explanation:  the  major 
warehouse-equipment  has  already  been  installed,  and  the 
capacities  are  non-transf erable  (sec. 2. 2).  Further,  the 
day-to-day  operating  cost  of  the  warehouse  is  relatively 
indifferent  to  what  equipment  is  being  used.  Thus  there  is 
no  clear  tradeoff  between  (say)  using  10  more  minutes  of 
crane-time  in  one  area  versus  using  2 more  containers  of 
storage  in  another.  Hence  no  criterion  for  minimization  can 
be  stated,  and  all  feasible  solutions  are  equally  palatable. 

The  fact  that  we  look  only  for  feasible  solutions  does 
not  restrict  the  applicability  of  our  approach  to  the 
special  problem  above.  In  Chapter  3 we  give  examples  of  how 
our  method  can  be  used  in  several  types  of  optimization 
appl ications . 


2.4  FACTORS  CONTRIBUTING  TO  COMPLEXITY  OF  PROBLEM 

Several  factors  make  (GP)  a complex  problem,  not 
amenable  to  standard  techniques.  These  factors  will  now  be 
discussed . 


*We  are  assuming  that  these  limits  already  include  some 
margin  of  safety.  This  point  will  be  discussed  further  in 
Chapter  5,  when  we  study  the  practical  aspects  of  our 
system . 


2.4.1  Immense  Si ze 


In  the  warehouse  we  have  1=60,000,  S = 1 6 , and  R=24. 
This  leads  to  a decision  vector  of  approximately  one  million 
components ! 


2.4.2  Part-Data  And  Usage  Functions 

The  diversity  of  Part-Data  (frequency  of  demand, 
quantity  demanded,  weight,  volume,  etc.)  and  the  dependence 


of  the  usage 

functions 

on  the 

physical  operation  of 

the 

warehouse  , 

leads  to 

usage 

functions  which  can 

be 

d iscontinuous 

and/or 

nonl inear 

and/or  nonconvex  . 

For 

example  if  a quantity  Q of  an  item  is  to  be  "picked”,  and 
fits  entirely  in  one  container,  then  one  crane  operation  can 
retrieve  the  container.  However,  for  larger  Q two 
containers  may  need  to  be  retrieved,  requiring  two  crane 
operations.  Thus  crane-time  usage  can  be  a discontinuous 
function  of  the  quantity  demanded. 


2.4.3  Incoming  New  Part-Numbers 

In  addition  to  the  60,000  items  in  the  warehouse,  there 
are  30-50  New  items  arriving  every  day.  As  mentioned  in 
sec. 2. 2,  these  are  not  replenishment  stocks,  but  entirely 
new  items.  Hence  30-50  new  allocations  x 1 have  to  be  made 
every  day,  and  clearly  we  would  like  to  make  "reasonable" 
decisions  without  re-solving  the  whole  problem  (GP)  for  the 
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comb  ined  set  of  old  and  new  items.  By  "reasonable"  we  mean 
a decision  that  will  remain  valid  in  the  long  run,  that  is 
we  should  not  have  to  re-allocate  these  New  Parts  in  the 
near  future. 


2.4.4  Shortcomings  Of  Standard  Techniques 

Initially,  we  might  have  been  tempted  to  try  Linear 
Programming  or  Integer  Programming  techniques,  using 
linearized  approximations  where  necessary.  In  view  of  the 
above  remarks  we  see  that  these  would  suffer  from  other 
major  disadvantages:  first,  the  decision  vector  of  one 
million  components  would  lead  to  an  astronomical  program; 
and  second,  these  methods  would  not  lead  to  any  strategy  for 
allocating  the  new  parts,  short  of  re-solving  the  problem 
for  each  new  set  of  parts. 

In  the  second  part  of  this  Chapter  we  shall  see  how  an 
appropriate  re  fortnul  at  ion  of  the  problem  (GP)  leads  us  to 
better  solution  tools. 
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Part  II: 


Decentral ized  Solution  Techniques 


- 


2.5  INTRODUCTION  TO  PART  II 

In  the  preceding  sections  we  described  the  Resource 
Management  problem  in  a Large  Warehouse,  discussed  the 
reasons  which  make  it  a hard  problem,  and  stated  why  we  are 
satisfied  with  any  feasible  solution  to  the  problem.  In 
this  Part  we  develop  the  beginnings  of  a solution  technique 
for  the  problem.  Our  method  is  to  set  up  an  "Artificial" 
optimization  problem,  and  to  use  Lagrange  Multipliers  to 
simplify  the  solution  to  this  problem,  thr.ough  decomposition 
into  a large  number  of  smaller  (much  easier)  problems.  We 
will  show  that  solving  these  smaller  problems  also  gives  an 
aoproach  for  the  New  Parts  problem  (sec.2.4.3).  Then  we 
will  describe  conventional  iteration  methods  which,  using 
the  smaller  problems,  might  be  able  to  solve  the  Resource 
Management  problem. 

The  use  of  Lagrange  Multipliers  for  decomposition  of 
large  problems  is  well  known  (see  discussion  in  sec. 2. 7). 
In  general  however,  such  methods  suffer  from  some  severe 
d isad vantages , the  most  devastating  of  these  being  the 
possibility  of  "duality  gaps".  This  means  that  there  may 
not  exist  multipliers  which  generate  an  optimum  to  the 
Artificial  problem.  A detailed  discussion  of  this  and  other 
disadvantages  is  given  at  the  end  of  sec. 2. 7. 
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In  the  remainder  of  this  Chapter  we  show  the  reader  the 
advantages  of  the  decentral ized  approach.  This  will 
motivate  us  to  investigate  whether  we  can  overcome  the 
disadvantages  mentioned  above.  It  shall  be  that  very 
investigation  whicn,  in  Chapters  3 and  4,  will  lead  to  • the 
main  contributions  of  this  work. 

2.6  THE  ARTIFICIAL  PROBLEM  AND  LAGRANGE  MULTIPLIERS 

2.6.1  The  Artificial  Problem 

In  order  to  put  the  problem  (GP)  [2.3.2]  in 
conventional  optimization  terms  we  formulate  the 
"Artificial"  Problem 

[2.6.1]  (AP):  max  J ( x)  = e:xl 

iTi 

[2.6.2]  subject  to  0 each  i 

[2.6.3]  Q*-e'x^>  0 each  i 

[2.6.4]  c-u(x)>0 
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feasible  solution  exists  to  tne  original  problem,  then  the 
optimal  value  of  the  Artificial  Problem  is  zero.  This  point 
will  be' amplified  in  Chapter  3-) 

2.6.2  Lagrange  Multiplier  Method 

We  tackle  (AP)  by  the  familiar  Lagrange  Multiplier 
method.  Let  X be  a R-dimensional  vector  of  Lagrange 
Multipliers.  We  write  the  Lagrangean  associated  with  (AP) 
as 

[2.6.5]  L(x,A)  = J(x)  - A’[u(x)-c] 

?'or  each  i,  let  X1  be  the  set  of  x.1  which  satisfy  [2.6.2] 
and  [2.6.3],  and  let  X be  the  set  of  x such  that  x^e  X*  for 
each  i.  Then  there  is  the  following  "Saddle  Point  Theorem" 
(see  for  example  Lasdon  [LI]): 

If  there  exist  (x*,A*)  with  x*eX  and  A*^  0 such  that 

[2.6.6]  L(x,A«)  < L ( x*  ,X * ) < L(x*,X) 

for  all  x6X  and  X >_  0,  then  x*  solves  the  problem  (AP). 

[] 

The  power  of  the  above  result  lies  in  the  fact  that  it 
does  not  depend  on  the  form  of  the  functions  J(x^)  and  u(jO  , 
nor  on  the  form  of  the  set  X [LI].  An  alternative  view  of 

[2.6.6] ,  which  will  be  used  below,  is  to  say  that 

[2.6.7]  x*  = arg  max  L(x:,A*) 

xex 

X*  = arg  min  L(x*,X) 

X >0 


[2.6.8] 
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2.6.3  Decentralization  Of  Decisions 

A key  point  to  note  is  that  for  given  X the  problem 
[2.6.7]  can  be  decentralized  since 

I .... 

[2.6.93  max  L(x,X)  = X'c  max  . { e ' x 1 -X  ' u1  ( d 1 , x 1 ) } 

x6X  £T- 1 jteX1 

Thus,  for  given  X,  the  decision  for  each  item  i can  be  taken 
independently  of  the  others,  by  solving  the  (much  simpler) 
Individual  Problem 

[2.6.10]  (IP):  rpax.  { e ' x i-X  ' u1  ( d 1 , x 1 ) } 

x ex 

We  note  here  that  in  Chapter  4 we  shall  further 
simplify  (IP).  In  Chapter  5 we  will  show  how  (for  the 
warehouse  case)  we  were  able  to  find  an  easy  solution  to 
this  individual  problem,  and  also  show  that  solving  (IP) 
gives  us  an  approach  for  the  New  Parts  problem. 


2.7  KNOWN  SCHEMES  FOR  FINDING  OPTIMAL  MULTIPLIERS 

We  see  above  that  a given  X,  through  (IP),  leads  to  an 
allocation  of  all  items,  say  £(X) , and  corresponding  total 
resource  usages  u(jt(X)).  We  can  therefore  think  of  u as  a 
function  of  X,  and  we  shall  henceforth  write  it  simply  as 
u(X) . The  problem  then,  is  to  find  the  X*  in  [2.6.8],  for 
then  from  (IP),  [2.6.7],  and  [2.6.6]  we  know  that  £(X*)  and 
u(X*)  are  optimal.* 

•For  the  moment  we  assume  such  a X*  exists.  This  issue  will 
be  triefly  discussed  in  the  next  section,  and  answered  in 
detail  in  Chapter  3. 
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Arrow  and  Hurwicz[A1]  observed  that  [2.6.7]  and  [2.6.8] 


suggest  an 

iterative 

sc  heme 

of  the  form 

[2.7.  1] 

k+1 
x = 

arg  max 

L(x,Xk) 

xex 

[2.7.2] 

xk+1  = 

arg  min 

. / k+1  . . 
LU  ,X ) 

X >0 


They  noted  that  [2.7.1]  can  be  solved  using  (IP)  with  given 
X=X  , and  an  alternative  to  solving  [2.7.2]  exactly  at  each 
iteration  is  to  let  X be  changed  in  the  direction  of  the 
negative  gradient  of  L with  respect  to  X,  which  is  then 
simply 


[2.7.3] 


. k+1 

h 


= < 


Xk  if  Xk  = 0 and  u .(Xk)<c  . 

J J J J 

Ak  + a . { u . (Xk ) -c . } otherwise 
J J J J 


where  a^  is  some  constant.  The  scheme  [2. 7. 1 ] , [2. 7. 3]  then 
has  an  intuitively  appealing  economic  interpretation.  A 
"central  co-ordinator"  chooses  a set  of  "prices"  X,  after 
which  the  items  i find  their  optimal  decisions  x1  for  this 
X..  The  central  co-ordinator  then  looks  at  the  total 
resource  usages  and  adjusts  the  prices  to  increase  the  cost 
of  over-used  resources,  and  decrease  the  cost  of  under-used 
resources  (but  never  making  any  cost  negative);  in  other 
words  he  adjusts  prices  according  to  excess  demand.  This 
use  of  decentralization  in  Resource  Allocation  problems  is 
well  known  [ A 1 , E 1 , G 1 , L 1 , S2 ] , and  arises  out  of  the  additive 
nature  of  the  objective  function  [2.6.1]  and  the  resource 
usage  functions  (see  [2.6.4]  along  with  [2.3.1]). 
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We  have  reduced  via  this  means  an  optimization  problem 
involving  S x I (=one  million)  variables  to  an  optimization 
problem  with  R (=24)  variables  plus  a set  of  I (=60,000) 
decoupled  and  relatively  simple  problems.  However,  we  must 
overcome  three  additional  difficulties: 

1.  The  decomposition  and  iteration  method  described 
above  falls  in  the  general  category  of  "dual" 
methods  CGI].  A major  shortcoming  of  these  methods 
is  the  existence  of  "duality  gaps"  [G3,L1]  --  this 
means  that  although  an  optimal  value  of  the 
Artificial  Problem  exists,  no  pair  (j(*,A*)  exists 
which  satisfies  the  Saddle  Point  Condition  [2.6.6]. 
Thus,  no  A can  achieve  the  optimum  value  using  the 
schemes  given  above.  (The  name  "duality  gap" 
derives  from  the  fact  that  there  is  a gap  between 
the  actual  maximum  and  the  maximum  achievable  using 
the  dual  method.) 

2.  Even  if  no  duality  gap  exists,  convergence  of 
[2.7.3]  is  guaranteed  only  when  strict  conditions 
(such  as  convexity/continuity)  hold  on  the  Payoff 
Function  and  Resource  usage  Functions  C A 1 , Z 1 ] -- 
conditions  which  certainly  do  not  hold  in  our 
problem . 

3.  Convergence  can  be  very  slow  even  given  the  above 


cond it  ion  s . 
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We  therefore  look  for  an  improved  scheme.  At  this  stage  the 
reader  might  well  ask  what  motivated  us  to  pursue  this  line 
of  attack  in  the  face  of  such  difficulties,  so  we  pause  a 
moment  to  gather  our  thoughts  on  this  issue. 


2.8  ADVANTAGES  OF  DECENTRALIZED  APPROACH 

The  reasons  why  we  chose  to  pursue  this  solution 
technique  are  manifold: 

1.  It  makes  possible  the  solution  of  a large 
intractable  problem,  by  reducing  it  to  a number  of 
smaller  problems. 

2.  Suppose  we  are  able  to  find  an  efficient  iteration 
technique,  and  use  it  to  generate  a solution  A*, 
with  corresponding  allocation  j<(A*).  When  demand 
characteristics  have  changed  slightly  over  some 
months,  we  still  expect  A*  to  be  a good  starting 
point  for  iterations  to  find  a new  solution.  Hence 
the  Periodic  Review  problem  (sec. 2. 2)  can  be  solved 
very  efficiently  each  time. 

3.  Given  a set  of  multipliers  A*,  the  New  Parts 
problem  (sec.2.4.3)  can  be  reduced  to  solving  (IP) 
for  each  new  part.  This  problem  is  relatively  easy 
and  can  be  solved  separately  for  each  new  patl. 
Hence  the  allocation  of  new  parts  is  (through  A*) 
made  independent  of  the  rest  of  the  parts  in  the 


-28- 


warehouse.  (The  practical  implementation  of  this 
scheme  demands  some  care,  and  is  described  in 
Chapter  5 . ) 

4.  The  economic  interpretation  of  the  scheme  (sec. 2. 7) 
makes  it  appealing  to  Managers,  who  readily 
understand  it.  Hence  they  prefer  it  to  other 
schemes  which  give  them  no  insight  as  to  the 
rationale  behind  a particular  allocation. 


Thus,  encouraged  by  the  positive  aspects  of  the 
decentralized  approach,  we  proceed  to  resolve  the  problems 
mentioned  in  the  preceding  section.  This  will  be  the 
subject  of  the  next  two  Chapters. 
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CHAPTER  3 

AN  EXISTENCE  THEOREM  FOR  OPTIMAL  MULTIPLIERS 

NOTE 

The  results  in  this  Chapter 
are  of  interest  in  their  own 
right.  For  this  reason  it  is 
written  so  that  it  can  be  read 
independently  of  the  other 
Chapters  . 

3.1  INTRODUCTION 

3.1.1  Motivation 

The  task  of  finding  feasible  solutions  to  large 
allocation  problems  can  often  be  quite  complex.  Such  a task 
occurs  in  many  situations  (see  sec.3.1.3).  In  this  work  we 
present  a technique  for  accomplishing  this  task.  In  the 
domain  of  Linear  Programming,  the  artificial-variable  method 
is  used  for  finding  initial  feasible  solutions,  by  setting 
up  an  Artificial  Optimization  problem.  Analogously,  our 
technique  is  to  set  up  an  Artificial  Problem,  and  use 


Lagrange  Multipliers  to  simplify  the  solution  to  this 
problem,  through  decomposition  into  several  easier  problems. 
This  effectively  reduces  the  allocation  problem  to  one  of 
finding  suitable  multipliers. 

The  contribution  of  this  Chapter  is  to  give  a firm 
theoretical  basis  for  our  technique.  The  main  theorem  gives 
simple  conditions  wh ich  guarantee  the  ex istence  of  optimal 
multipliers  for  the  Artificial  Problem.  Although  the  use  of 
Lagrange  Multipliers  for  allocation  problems  is  well  known, 
such  existence  results  have  previously  been  given  only  under 


strict  conditions 

(e.g  . 

convexity,  continuity). 

Our 

Theorem  justifies 

the 

use  of  this 

technique  in  many 

cases 

where  the  resource 

usage 

f unctions 

are  nonconvex 

and/or 

discontinuous,  and  the  domains  of  the  decision  variables  are 
nonconvex . 

An  iterative  algorithm  for  solving  the  Artificial 
Problem,  using  our  technique,  is  described  in  the  next 
Chapter.  In  that  Chapter  we  also  show  that  this  algorithm 
has  proveable  convergence  properties,  and  a quadratic  rate 
of  convergence.  It  is  thus  expected  to  be  of  practical 
value;  indeed,  in  Chapter  5 we  describe  one  instance  of  its 
use  in  a large  warehouse. 
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3.1.2  Abstract  Statement  Of  Problem 

In  Chapter  2 we  described  the  Resource  Management 
problem  in  a very  large  FIAT  spare-parts  warehouse.  In  this 
section  we  state  the  problem  in  abstract  terms,  so  that  the 
reader  can  appreciate  the  generality  of  our  work. 

Consider  the  problem  of  allocating  several  resources 
among  a set  of  independent  activities.  Specifically,  let 


A1,  i e { 1 , 2, . . . , 1}  be  the  ith  Activity. 

xXe  S1  be  the  strategy  (decision)  of  A1,  with 

S*-  a discrete  strategy  set  with  a 
finite  number  of  elements. 


xx  R 

u (x  ) S E be  an  R-d imensional  resource  Usage 

vector,  where  u^(xi)  is  the  amount  of 
resource  k used  By  A^  when  strategy  x* 
is  employed  (each  ui(xi)X)). 

x£[x\x2 x1]'  be  an  Allocation  (each  x*6S*). 


u(  x) = }“  ux(  X1) 
iTi 

c e er 


be  the  total  resource  Usage  by  an 
allocation . 

be  the  vector  of  Constraints  on  the 
usages  (c,  =limit  on  u3age  of  resource 
k)  * 


Typically,  we  have  in  mind  problems  where  I (the  number 
of  activities)  is  in  the  tens  of  thousands,  R (the  number  of 
resources)  is  in  the  range  10  to  50,  and  the  number  of 
elements  in  each  strategy  set  is  in  the  hundreds. 


d 
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Usual  1 y , one  has  the  following  "Optimization  Problem": 

[3-1.1]  (OP):  max  H(j<)  a real  valued  function 

x 

subject  to  x1  6 S1  for  each  i 
and*  li ( £ ) _<  c 

In  this  Chapter  we  consider  the  "Feasibility  Problem": 

[3-1.2]  (FP):  Find  x such  that 

x1  6 S1  for  each  i 
and  u ( x ) < c 

3.1.3  The  Utility  Of  The  Feasibility  Problem 

The  solution  to  (FP)  is  useful  for  a variety  of 

reasons : 

1.  In  the  case  of  the  problem  (OP),  there  may  exist 
iteration  methods  which  improve  a given  feasible 
solution.  In  this  case  a completely  different 
algorithm  may  be  required  to  generate  an  initial 
feasible  solution.  (See  [Ml]  for  an  example  of 
this . ) 


*A11  vector  inequalities  are  to  be  interpreted 
componentwise.  (See  notation  conventions  following  Table  of 
Contents . ) 


2.  In  the  case  of  an  operational  system,  it  may  be 

required  only  to  keep  i t operational , i.e.  to  find 
an  allocation  such  that  no  constraints  are 

violated,  and  to  maintain  this  condition  during  the 
operation  of  the  system.  Examples  of  this  are 
Parts-allocation  in  a large  warehouse  (see  Chapter 
2)  and  file-allocation  in  a distributed  computer 
network  (Chapter  6). 

3.  In  many  system  design  problems,  a un ique  criterion 
to  be  minimized  cannot  be  formulated.  In  such 
cases,  one  approach  is  to  define  certain 
constraints,  find  a solution  to  satisfy  these,  and 
then  look  at  the  other  performance  parameters  of 
the  resulting  design.  This  approach  has  been 
advocated  by  Chang  [Cl]  for  the  general  Distributed 
Computer  System  design  problem.  It  has  also  been 
proposed  for  solving  mul tier  iter ia  decision 
problems  [ L 3 ] - 


When  I is  very  large,  the  functions  u1(.)  are  not 
linear,  and  each  S1  contains  a substantial  number  of  points, 
(FP)  may  itself  be  a difficult  problem.*  Our  method  is  to 


*We  are  assuming  ofcourse  that,  for  each  i,  S*  does  not 
contain  an  such  that  ul()lM=Q.  For  if  this  were  the 
case,  we  could  set  xi  = Ji  for  each  i and  the  prnhlem  (FP) 
would  be  trivial.  In  the  warehouse  problem  (Chapter  2)  for 
instance,  each  part  must  be  allocated  somewhere,  so  that 
there  is  no  x i for  which  ui(xi)=0. 
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set  up  an  "Artificial"  Optimization  problem,  and  to  use 
Larange  Multipliers  to  simplify  the  solution  to  this 
problem,  through  decomposition  into  I (easier)  problems. 
The  decomposition  technique  is  well  known  [ A 1 , E 1 , G 1 , L 1 , 52 ] . 
In  general  however  such  methods  suffer  from  three 
disadvantages , which  are  briefly**:  (i)  The  existence  of 


"duality 

gaps",  which  means 

that  the 

re 

may 

not  exist 

multipliers  wnich  generate 

an 

optimum 

to 

the 

Artificial 

Problem , 

( i i ) Convergence 

of 

solution 

algorithms  requires  J 

stringent  conditions  on  the  functions  u1  ( . ) and  the  sets  S1 , 
and  (iii)  Slow  convergence  of  such  algorithms. 

In  this  Chapter  we  give  simple  conditions  which 
guarantee  the  existence  of  optimal  multipliers  for  the 
Artificial  Problem,  under  very  general  conditions  on  the 
u1  ( . ) and  S1 . A solution  algorithm  for  the  Feasible  Problem 
(above)  is  described  in  Chapter  4.  We  mention  that  this 
algorithm  has  guaranteed  convergence  properties  under 
stricter,  but  still  fairly  general  conditions  on  the  u1 ( . ) . 
It  is  being  successfully  used  in  a FIAT  spare-parts 
warehouse  with  1=60,000  , each  S1  has  16  points,  the 
functions  u1 ( . ) are  nonlinear  and/or  discontinuous,  and  the 
number  of  resources  (R)  is  24  (Chapter  5). 


**These  points  are  repeated  here  to  keep  this  Chapter 
self-contained.  See  sec. 2. 7 for  details  and  references. 
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3.2  THE  ARTIFICIAL  PROBLEM 

3.2.1  Analogy  With  Linear  Programming 

An  artificial  problem  is  used  in  Linear  Programming 
(LP)  to  generate  an  Initial  Basic  Feasible  Solution  (IBFS). 
A brief  review  of  the  technique  follows  (see  [D1]  or  [ L 5 ] 
for  details) . 

r 

Consider  an  LP  problem  whose  constraints  are: 

[3-2.1]  (LP1):  A^  = b where  A is  a matrix,  and  b >_0 

x > 0 . 

(Through  use  of  slack  variables  it  is.  always  possible  to 
express  inequality  constraints  in  this  form.)  In  order  to 
find  an  IBFS  to  (LP1)  we  consider  the  (artificial)  problem 

[3.2.2]  (LP2):  min  J~  z i 

T" 

subject  to  Ax_  + z = b 
x > 0 
z > 0 . 

Clearly,  if  a feasible  solution  exists  to  (LP1),  then  the 
minimum  value  of  (LP2)  will  be  zero,  and  conversely.  But 
now,  (LP2)  has  an  obvious  IBFS  (.x-0,  z.=  b)  , and  can  be  solved 
using  the  simplex  method.  The  optimum  to  this  problem,  if 
zero,  will  result  from  a value  of  x_  which  can  now  be  used  as 
an  IBFS  for  (LP1 ) . 


-36- 


Thus  we  see  that  the  above  technique  (called  the 
Artificial  Variable  method  in  LP)  sets  up  an  "artificial" 
problem  whose  optimum  is  any  feasible  solution  to  the 
original  problem.  This  is  essentially  our  approach  below. 


3.2.2  Formulation  Of  Artificial  Problem 

[3.2.3]  Definition ; ( Augmented  Strategy  Sets)  To  each 

strategy  set  Sx  , let  us  adjoin  a decision  9,  with 

ux(9)  £ 0 . 

We  shall  also  use 

Sx  = SXU  {9} 

„ 1 2 I 

S = S x S x . . . x S 
S = S 1 x S2  x . . . x S1 

so  that  x_9S  <=>  x19S1  for  each  i ; similarly  x_6S  + . [] 

[3-2.4]  Remark : We  can  assume,  without  loss  of  generality, 
that  90S1 . For  if  09S1  for  some  i,  then  set  xx=9,  remove  x1 
from  jc,  and  then  solve  the  remaining  problem  below.  Also 
see  footnote  in  sec. 3.  1.3.  [] 
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We  can  now  state  the  Artificial  Problem: 

[3.2.5]  (AP):  max  J (x)  = J~  p^x1) 

I iTi 

such  that  x e S 
— + 

and  u(x)  < c 


where 

px=0  if  xX  = 9 

p^  = P ^ if  x16S1  (P1  are  positive  constants). 

Now  let  J*  = J~  P*  . In  analogy  with  the  LP  example, 

1=1 

we  see  that  if  a feasible  solution  exists  to  (FP),  then  the 
maximum  value  of  (AP)  will  be  J*,  and  conversely,  any  x. 
achieving  J*  in  (AP)  must  be  a solution  to  (FP). 


3.2.3  Lagrange  Multipliers  And  Decentralized  Solutions* 

We  tackle  (AP)  by  the  familiar  Lagrange  Multiplier 
method.  Let  A be  a R-dimensional  vector  of  Lagrange 
Multipliers.  We  write  the  Lagrangean  associated  with  (AP) 
as 

[3.2.6]  L(x,A)  = J (.x)  - A'[u(j<)-c] 


*In  order  to  make  this  Chapter  self-contained,  the  next  few 
paragraphs  are  duplicated  from  Chapter  2.  There  will  be  no 
further  repetition  of  any  material  in  this  Chapter. 
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Then  there  is  the  following  "Saddle  Point  Theorem"  (see  for 
example  Lasdon  [LI]): 

If  there  exist  with  x_*eS  + and  X*>^  0 such  that 

[3.2.7]  L(x,X»)  < L ( x*  ,X  * ) < L(x*,X) 

for  all  £es+  and  X 0,  then  solves  the  'problem  (AP). 

[] 

The  power  of  the  above  result  lies  in  the  fact  that  it 
does  not  depend  on  the  form  of  the  functions  J ( jc ) and  u(£)  , 
nor  on  the  form  of  the  set  S+  [LI].  An  alternative  view  of 

[3.2.7] ,  which  will  be  used  below,  is  to  say  that 

[3.2.8]  x*  = arg  max  L(x,X*) 

x6S 
— + 

[3.2.9]  X*  = arg  min  L(x*,X) 

X >0 

We  note  further  that  for  given  X the  problem  [3-2.8]  can  be 
decentralized  since 

[3-2.10]  max  L(x,X)  = X'c+V~  max  . { p1  ( x 1 ) -X  ' u1  ( x 1 ) } 
xes  fr- 1 xxes 

Thus,  for  given  X,  the  decision  for  each  item  i can  be  taken 
independently  of  the  others,  by  solving  the  (much  simpler) 
Individual  Problem 

[3.2. 11]  (IP):  ?axi  { p1 ( x1 ) -X ' u1 ( x 1 ) } 

x es 

(This  problem  could  be  solved,  at  worst,  simply  by 
enumerating  the  points  of  sM. 

For  given  X we  shall  denote  the  solution  to  [3.2.8]  by 
2(X)  , that  is 

[3.2.12]  5 ( X ) = arg  max  L(x,X) 

x€S 
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Si  no  e the  solution  to  [3-2.12]  is  relatively  simple  for  a 
given  A,  our  aim  is  to  determine 

1.  Whether  there  exists  a A*  such  that  ^(A*)  achieves 
the  maximum  in  (AP)  ? 

2.  If  such  a A*  exists,  how  can  it  be  found  ? 


As  already  mentioned,  the  second  question  will  be 
tackled  by  us  in  Chapter  4.  In  the  rest  of  this  Chapter  we 
address  the  first  question.  Our  theorem  will  be  quite 
different  from  the  Saddle  Point  Theorem. 


3- 3 MAIN  THEOREM 

The  question  of  existence  of  an  optimal  A for  a given 
problem  has,  in  general,  only  been  answered  in  the 
literature  under  certain  convexity  conditions  [G3,L1,L4]. 
In  this  section  we  take  advantage  of  the  structure  of  a 
large  allocation  problem  to  give  a far  more  general  result. 
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[D3.3.1]  De  f in  it  ion : We  define  the  R-d  imensional  vector  A, 
whose  kth  component  represents  the  largest  change  in  the 
usage  of  the  k resource,  that  can  be  caused  by  a single 
activity  A1  (with  its  decisions  restricted  to  its  strategy 
set  S1 ) : 


max  iaxifl  i iuk(xi)-uk(x2)i  [] 

i xjes  .xjes  K 1 K * 


[ T 3 • 3.2]  Theorem  3-1: 

If  there  exists  an  ! S S+  with  J(X)=J*  and  u(jO£  Z-aA,  where 
3=(R-1)/2,  then  there  exists  a X*  >0  and  an  2_(X»)  as  in 
[3.2.12]  such  that  J(X(X*))=J*  and  u(^(X*))£  5,  that  is, 
5UX*)  solves  (AP).  [] 

For  a large  problem  with  (say)  several  thousand 
activities  using  each  resource,  we  would  expect  aA  to  be 
very  smal 1 in  comparison  with  2.  In  that  case  we  can  give 
the  following 


Inter pr etation  o f Theorem  3 -I : 

If,  for  a slightly  tighter  set  of  limits,  the  original 
problem  is  still  feasible,  then  there  will  exist  a X*>0  such 
that  the  (decentralized)  solution  ^(X*)  will  also  be 
feasible  for  the  original  problem.  (This  in terpretation 
will  be  illustrated  by  a diagram  in  sec. 3. 5.4)  [] 
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Remarks  : The  importance  of  our  theorem  is  threefold  -- 

1.  We  have  given  conditions  under  which  there  will  be 
no  duality  gap  in  the  Artificial  Problem. 

2.  Our  conditions  require  no  convexity  and/or 

continuity  and/or  linearity  assumptions.  In 
comparison  to  existing  conditions  they  are 

extremely  mild,  and  likely  to  be  true  for  most 
large  systems  (since  aA  is  very  small  in  comparison 
with  5,  as  explained  above). 

3.  If  each  S1  has  N components,  our  theorem  justifies 
replacing  the  IxN  Integer  Programming  problem  for  x 
by -the  R-dimensional  Nonlinear  Programming  problem 
of  finding  a suitable  A.  For  instance,  in  the  case 
of  the  FIAT  warehouse  problem  (Chapter  2),  IxN  has 
approximately  one  million  components,  whereas  R has 
only  24. 


3.3.1  Outline  Of  Proof  Of  Theorem 

Since  the  proof  of  the  above  Theorem  is  quite  long,  we 
give  the  reader  a flavour  for  what  needs  to  be  accomplished. 
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3.3. 1.1  Motivation  - Let  us  consider  the  R+1  dimensional 
space  of  payoff  versus  resource  usage  for  (AP).  In  this 
space  consider  the  set 


V 

a. 


j = { ! J =J  ( JC ) , u=u(x),  for  x_  e S+  } 

The  problem  [3.2.12]  finds  a solution  + such  that 
J ( x ) -A ' [ u( X) -5 ] > J ( x) -X ' [ u( x ) -5  ] for  all  x€S 


Tti) 

u.(?) 


or,  with  (.-J  = L 


[1 ,A< ][*!  > Cl  ,X'  ] 


we  can  write  this  as 


r A t v 
J > 

A 

I K 


for  all 


V 

a 


e 3 


which  is  a support  hyperplane  for  the  set  passing  through 


the  point  L*  6 j.  Thus,  solutions  found  using  [3.2.12]  can 
only  lie  on  these  support  planes. 


The  source  of  duality  gaps  is  that  (if  3 is  nonconvex, 
say)  there  may  exist  solution  points  on  the  nonconvex 
boundary  of  3 which  are  not  touched  by  any  support  plane  of 

3. 


Our  approach  is  to  show  that,  although  such 
"unreachable"  points  may  exist,  the  aA  condition  in  the 
Theorem  assures  us  that  there  exists  some  other  "reachable" 
point  which  is  also  an  optimum  for  (AP).  We  do  this  as 
follows  . 


First  we  look  at  the  projection  of  a subset  of  3 onto 
the  u-space.  Specifically,  we  look  at  the  set  U*  of  all  u 


such  that 


with  J=Jfi.  mis  will  be  illustrated  by  an 


example . 
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3-3-  1-2  Simple  Allocation  Example  - 


Let  1=3  (number  of  activities) 
and  R=2  (number  of  resources) 

Suppose  the  activities  are: 


A1 

with 

x 1 es 1 = 

[a,b] 

and 

u 1 ( a) 
u (b) 

= 

[1,4]' 

= 

[3,3]' 

A2 

wi  th 

x2es2  s 

{ a , b } 

and 

u?(a) 

- 

[3,5]’ 

u ( b) 

- 

[4,3]’ 

A3 

with 

x3es3  = 

{ a ,b  ,c  } 

and 

u^  ( a) 

- 

[4,4]’ 

u^(b) 

Z 

[5.5, 3 

u3(c) 

Z 

[6,2]' 

The  augmented  strategy  set  S^_,  for  example,  would  be 
S^={a,b,0}  with  u 1 (0 ) = [0 , 0 ] ' . 

The  possible  values  of  u(£)=}—  u^(x^),  for  x.es , are 

1 = 1 f W 

shown  in  Table  3-1.  For  example,  in  the  4 row  of  the 
table,  for  x=[a,b,a]'  we  have 

u(x)  = u1 (a)+u2(b)+u3(a)  = [9,11]'  . 

Since,  for  J ( jc ) = J * we  must  have  x^S , the  set  U*  is  precisely 
the  set  of  values  in  the  Table,  and  this  set  is  illustrated 
in  Fig . 3-1 . 

By  examining  the  values  of  u1(x1)  above,  we  find  (see 
definition  [D3- 3- 1 ] ) 

A = [2,2]'  . 

Since  R =2 , we  have  a = (2  — 1 )/2  = 0.5  so  that 

a 4 = [1,1]'  . 

We  shall  need  this  value  in  later  illustrations.  [] 
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Fig . 3-1  : The  set  U*  for  the  Example. 

The  set  consists  of  all  possible  values  of  u(jO  , shown  by 
heavy  dots.  The  line  H is  a typical  support  hyperplane  for 
the  set  U»,  passing  through  the  point  Q. 
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With  U*  as  defined  above,  our  proof  shows  that  the  aA 
condition  in  Theorem  3-1  guarantees  the  existence  of  some 
u*6U»  which  lies  on  the  "reachable"  boundary  of  U*,  and  also 
has  u*£  5.  Then  we  proceed  to  the  (J,u)  space,  where  we 
construct  a suitable  A*  such  that  the  correspond ing 
hyperplane  (as  above)  passes  through  (J*,u*),  is  a support 
for  3 , and  also  satisfies  A*>0.  This  proves  the  Theorem. 
We  shall  illustrate  these  points,  using  the  example,  as  we 
proceed  with  the  proof. 


3.3. 1.3  Stages  In  Proof  - Our  proof  depends  heavily  on  the 
properties  of  polyhedra,  and  these  are  reviewed  in  the  next 
section.  Then  we  define  a set  W,  related  to  U*  above,  and 
define  its  convex  hull  (?.  Sec. 3.  5.2  characterizes  the 
support  planes  of  W.  Sec. 3. 5. 3 characterizes  certain  faces 
of  and  shows  why  we  need  only  consider  such  faces. 
Sec. 3. 5. 4 accomplishes  the  existence  of  a u*  as  described 
above.  Finally  Sec. 3. 5. 5 carries  out  the  construction  of  a 


suitable  A*. 
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3.4  CONCEPTS  RELATING  TO  POLYHEDRA  --  A BRIEF  REVIEW 

Our  proof  relies  heavily  on  the  properties  of 
Polyhedra.  Hence  a review  of  relevent  concepts  is  given 
here.  The  reader  should  be  aware  that  several  different 
(but  equivalent)  definitions  can  be  given.  Our  exposition 
is  taken  mainly  from  [SI]. 

We  shall  be  concerned  with  n-dimensional  Euclidean 
spaces  En . For  U^En,  ^SU,  and  a1  scalars,  we  define  the 
conical  hull  of  U 

N . . 

cone(U)  = [^  a1^1  ' al>0}  for  any  finite  N, 
iTi 

the  convex  hull  of  U 

N . . N . 

convex(U)  = l /^"a1^1  ! a1^),)- aL  = 1}  for  any  finite  N, 
iTi  iTi 

and  the  affine  hull  of  U 

N N . 

affine(U)  = { }T~ a1^1  ! J~ a1:!}  for  any  finite  N. 
iTi  iTi 

The  sum  of  two  sets  U,VC£n  is 

U+V  = {u+v  | u6U  and  v eV } . 

[D3.4.1]  De  f in  it  ion : The  solution  set  PcEn  of  a finite 
system  of  linear  inequalities 

P = { c ! hl  'c  < b1 , h1 ,c  6En , i = 1 , . . . , N) 
is  called  a convex  polyhedron  (or  briefly, 
polyhedron  ) . [ ] 
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[P3*^*2]  Property : If  U and  V are  finite  point  sets  in  En 
then 

P = convex(U)  + cone(V) 
is  a pol yhedron  . [ ] 

In  the  following  statements,  P will  always  denote  a 
polyhedron. 

[D3-4.3]  Def in ition : In  [D3.4.1],  for  each  i the  set 

{ c | h* ' c = b1  } 
is  called  a bound ar y plane  of  P.  [] 

[D3- 4.4]  De  f in i t ion : The  intersection  of  P with  some  of  its 

boundary  planes  is  called  a fac e . (A  face  is  itself  a 

polyhedron.  Also,  if  FI  is  a face  of  P and  F2  a face  of  FI, 

then  F2  is  a face  of  P. ) A face  consisting  of  exactly  one 

point  is  called  a vertex . A face  which  is  different  from  P 
is  called  a pr oper  face . [] 

C D 3 - ^ • 5 ] Def in ition  : c is  an  ex  tr erne  point  of  P if 
c 6 convexl  c1  ,c“ , . . . ,c_^  } for  c*6  P 
implies  c1=c  for  all  i.  [] 

CP3- 6]  Property : Every  extreme  point  of  P is  a vertex, 
and  conversely.  [ ] 

[D3«*J.7]  De  f in  it  ion  : P is  called  bounded  if  there  exists  a 
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scalar  e such  that  for  all  c6P,  |c^|<e.  [] 

[ P 3 - ^ - 8 ] Property  : Every  bounded  polyhedron  is  the  convex 
hull  of  its  finitely  many  extreme  points  (vertices).  [] 

[P3- 4. 9J  Property : P is  bounded  if  and  only  if  it  contains 
no  hal fl ine  . [ ] 

[D3.*4.10]  Definition  : Any  point  belonging  to  a proper  face 
of  P is  a bound ar y point.  All  other  points  of  P are  inner 
points . [ ] 

[D3-4.11]  Definition  : A boundary  plane  containing  the 

entire  polyhedron  is  called  singular.  (For  example,  if  the 
face  F is  the  intersection  of  P with  the  boundary  plane  H , 
then  w.r.t.  the  polyhedron  F,  the  plane  H is  a singular 
boundary  plane.)  [] 

[P3.^.  12]  Property:  If  c is  an  inner  point  of  P,  and  c"  an 
arbitrary  point  on  the  intersection  of  all  singular  boundary 
planes  of  P,  then  tnere  exists  an  e>0  such  that 

c + e ( £-£" ) and  c- e ( c-c"  ) 

are  both  in  P.  (If  there  are  no  singular  boundary  planes, 
the  above  intersection  is  En  by  convention.)  [] 

[P3-4.  131  Property : (This  is  true  of  convex  sets  in 

general.)  Let  UCEn  3nd  put  dsdimension  of  affine(U).  Then 
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for  each  point  u S convex(U)  there  exist  d + 1 points  u 1 eu 

1 d 1 

such  that  u e convexlu  ,...,u  }.  [] 


3.5  PROOF  OF  THEOREM 
3. 5. 1 The  Sets  W And  W 

D 

Consider  the  R-d imensional  space  E in  which  the  vector 
u(x.)  lies,  and  consider  any  constraints  vector  c in  this 
space.  Following  [L4]  we  define  the  Primal  Functional 
G(£)  = max  J (£)  subject  to  u(j<)£  c 

xes 

— + 

D 

and  define  WC  E as 

W = { c ! c >0  and  G(c)=J*} 

so  that  W is  the  set  of  all  constraints  for  which  a feasible 
allocation  exists  for  the  original  problem.  Also  let 

t?  = convex(W)  . 

See  Fig. 3-2  for  an  illustration  of  W and  8. 

[D3.5.1]  Definition : We  shall  say  S doin inates  c if  6 < c, 

D 

and  we  shall  say  c is  a non-dom inated  point  (NDP)  of  VCE 
if  2eV,  and  for  all  c6V  we  have 

c < a =>  c=a 


(that  is,  c is  not  dominated  by  any  other  point  of  V). 
Fig. 3-2  for  examples.  [] 


See 
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*/e  now  make  some  observations  on  W and  71,  for  later 

use  . 

[03-5.2]  26W  and  £ dominates  c =>  c6W. 

[03-5.3]  (xes+1  J(x)=J»)  <=>  (x6S,  u(x)ew). 

[03*5.4]  is  a polyhedron.  Proof: 

D 

Let  U = { u(£)  I £es  } , a finite  point  set  in  E 
and  V = { } , the  R unit  vectors  in  E^ . 

Then,  by  definition  of  W, 

cew  =>  c £ u (for  some  ueu ) 

R 

rl  i 1 

a e (for  some  a £0) 
i=l 

ana  conversely,  so  that 

W = U + cone(V)  . 

Now  = convex  (W) 

= conv  ex  ( 1)  +cone  ( V ) ) = convex(U)  + cone  (V) 
where  the  equivalence  in  the  final  step  is  easily 
established.  Hence  by  [P3.4.2]  J is  3 polyhedron. 

[03.5.5]  2efl  and  2 dominates  c =>  c€3  (compare  [03.5.2]). 
Proof:  This  follows  from  the  final  equation  of  [03.5.4]. 
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3.5.2  On  Representing  The  Boundary  Planes  Of  W 


CL3-5-6]  Lemma : Let  H be  a boundary  plane  of  W,  which 

passes  through  c*6W.  Suppose  H is  represented  as 

[3-5.7]  { 0 ! h'c=h’c*  } 

the  sign  of  h being  chosen  such  that 

[3.5.8]  _h'c*  1 h'£  for  all  ceW  • 

Then  h >0  (componentwise).  [] 


Proof : The  proof  is  by 

components  of  h are  negative. 
Consider  2 such  that 


contradiction.  Suppose  some 
Let  h.  be  one  such  component. 


2 .> c*  and  2.  = c*  ( V.i  j ) . 
j j k K 


[3.5.9]  h'2  = h'(2-c*)  + h'c* 

= hj(5j-c^)  -t-  h'c* 

< h'c*  . 

But  clearly  2 £ c*  so  that  2SW  (using  [03.5.5])  and  thus 
from  [3.5.8]  h'c*  < h'2,  which  contradicts  [3.5.9].  [] 


[C3-5.10]  Corollary:  With  definitions  as  in  the  previous 
Lemma,  if  £*  is  dominated  by  25W,  then  20H.  [] 


Proof : 2&W  =>  h ' c*£  h’2  using  [3.5.8]. 

h >0  and  2 < c*  =>  h'2  < n'c*.  Hence  h'2=h'c*.  [] 


-54- 


3.5.3  Characterization  Of  Non-dominated  Faces 

[D3.5.11]  De f in i t ion : A proper  face  F of  3 such  that  some 
inner  point*  of  F ' s an  NDP  of  R,  is  called  a non-dominated 
face  of  R.  The  set  of  all  non-dominated  faces  of  R will  be 
denoted  PI.  (See  Fig. 3-3)  [] 


The  justification  for  naming  an  entire  face,  on  the 
basis  of  one  point,  follows  from  the  next  Lemma. 

[ L 3 . 5 . 12]  Lemma : Let  PJ  be  as  above,  and  let  FePT.  Then 

a.  PI  includes  all  NDPs  of  R. 

b.  All  points  of  F are  PJDPs  of  R. 
c . F i s bounded  . 

d.  F is  the  convex  hull  of  some  extreme  points  of  W. 

F F 

e.  If  c 6 Ff>W,  there  exists  xSS  with  u(x)=c  . 

[] 

Note  that  (a)  and  (b)  above  show  that  the  set  of  all 
points  which  are  NDPs  of  R is  equivalent  to  the  set  of  all 
points  on  faces  in  PI. 


•Note  that  from  the  definition  of  inner  points  [03.4.10],  a 
vertex  is  an  inner  point  of  itself. 
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Proof : 

a.  Suppose  £ is  an  NDP  of  W.  Now  c cannot  be  an  inner 

point  of  ft,  since  it  would  be  possible  to  find  another 
point  in  3 which  would  dominate  it  (see  [P3.4.12]). 
Thus  £ must  lie  on  a proper  face,  say  F'.  If  £ is  an 
inner  point  of  F',  then  F'SN  and  £ is  included  in  N.  If 
£ is  a boundary  point  of  F'  then  it  must  lie  on  a proper 
face  F"  of  F'.  Eventually,  since  ft  is  finite 

dimensional,  £ will  either  be  an  inner  point  of  some 
face,  or  will  lie  on  a vertex,  wnich  is  an  inner  point 
of  itsel f . [ ] 

N 

b.  Let  £ be  an  NDP  of  and  an  inner  point  of  F.  Now 
suppose  that  £^6F  is  dominated  by  some  . For  any 
singular  boundary  plane  of  F we  have** 

h'£^=h'£^£  h'£  (with  h as  in  [L3.5.6]). 

But  also  h £0  [L3.5.6],  and  £ < c^  with  strict 

inequality  for  at  least  one  component,  say  *^<0^, 

implies  h' (£D-£)=0  and  hj=0.  Thus  if  is  the  unit 

vector  in  the  j direction,  h'e^=0.  This  means  that 
N i 

£ +e  lies  on  the  intersection  of  all  singular  boundary 

planes  of  F so  that  [P3.4.12]  there  exists  an  a>0  such 

N i N 

that  £ -ae  S F.  But  this  point  dominates  £ , wnich  is 

a contradiction.  Thus  cD  must  also  be  an 


**We  use  this  phrase  frequently,  to  denote  "a  boundary  plane 
of  W which  is  a singular  boundary  plane  of  F 
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NDP  of  ft.  [ ] 

c.  F is  a convex  polyhedron  [D3.4.4].  If  it  is  not 

C 

bounded,  ihen  [ P 3 • ^ . 9 J there  must  exist  c 6F  and  c such 
that 

c^+bc  e F for  all  b>0. 

p 

In  addition,  we  must  h3ve  some  c.<0,  or  else  c +bc  will 

F 

be  dominated  by  c (which  cannot  be,  by  (b)  above). 

Choose  b large  enough  so  that 

c^  + be.  <0 
l l 

which  is  a contradiction,  since  W (and  hence  ft)  consist 
of  vectors  with  non-negative  components.  [] 

d.  Since  F is  bounded,  it  must  be  the  convex  hull  of  its 

extreme  points  [P3.4.8J.  These  points,  being  faces  of 

F,  are  also  faces  of  ft  [D3.4.4]  hence  extreme  points  of 

ft.  Let  £ be  such  a point.  By  definition  of  ft 

12  N 

£ = convex  [c  ,c  , ...  ,c  } for  c^W. 

This  implies  that  [D3.4.5]  £=£1  for  all  i,  so  that 

c ew . [ ] 

F F 

e.  Since  £ ew , there  exists  £es+  with  u(£)<  £ and  J(x)=J*. 

F F 

But  £ is  an  NDP  of  ft,  and  j(x)Wc  3,  implies  £(£)=£  . 

Also  J(x)=J»  =>  xes.  [] 
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3.5.4  From  Premises  Of  Theorem  To  Allocation  On  F 

[L3.5.13]  Lemma : (Delta- Domination  Lemma ) 

F f 

For  any  0 6 F 6 17,  c +aA  is  dominated  by  some  c*6  WfAF, 
where  a and  A are  as  in  Theorem  3-1.  (See  Fig. 3-4)  [] 

Proof : This  is  quite  involved  and  is  left  to  the  Appendix. 

[L3.5.74]  Lemma:  If  there  exists  SOS  such  that  J(X)=J*  and 
— + — 

u(X)£  £-aA,  then  there  exists  x*es  such  that 
a . u( x* ) < Z 

b.  u(£*)  & FO  W,  for  some  FeiT. 

[] 

Proof : (See  Fig. 3-5)  Since  J(x)=J*,  £-aA  6 W.  Now  choose 

p 

some  £ SF  (some  F 6 ft ) , which  dominates  £-§A.  (The  reader 
can  verify  that  such  a choice  is  always  possible.  In 
particular  if  2-aA  is  itself  an  NDP , we  can  just  choose 

p 

£ =2-aA.)  Then  from  [L3.5.  123,  there  exists  a £*6  FH  W such 
that 

£*  £ £^  v a A 

p 

but  also  £ < 2-aA 

so  that  £*<2.  But  now  since  c * e FDW,  from  [ L 3 . 5 . 1 2e  ] 
there  exists  some  x*es  with  u(x*)=£*. 

[] 
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3.5.5  Construction  Of  Suitable  X 


[L3.5.15]  Lemma : If  there  exists  ^ as  in  the  preceding 
Lemma,  then  there  exists  a A *>0 , and  an  ^(A*)  as  in 
[ 3 • 2.  12  ] , such  that 

J(X(A*))  = J»,  and  u(x(A»))<  5 [] 


Proof : Let  £*  and  £*  be  as  in  the  previous  Lemma,  and  let 

H,h  be  as  in  [L3-5.6].  Now  consider  the  R + 1 dimensional 
space  of  Payoff  versus  resource  Usage,  with  points  t such 
that  J=J  (x)  , u = u(  x.)  , for  some  x.es  + . Define  the  sets 


J<  = { 


! J < J * } , and  j_  = { 


! J <J  * } 


Since  the  allocations  are  discrete  points,  has 


finite 


l]  e 3<  1 


number  of  points  and  there  exists  a J'  such  that 

J ' = max  { J ! 

Choose  b>h' c*/(  J ’*-J  ' ) . We  will  now  show  that 

X*  = (1/b)h 

is  a suitable  value  for  X. 


First,  we  note  that  (see  [L3-5.6]  and  definition  of  J') 


A *>0 . 


M «, 


Next,  consider  any  /“  | ^ . Then 

b > h'£*/(J*-J)  since  J<J  ' 

> ( h ' £*-h ' u) /(  J *-J  ) since  h >0,  u >0 
=>  J*-J  > (h'c*-h'u)/b 


that  is, 
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[3.5.  16]  J*-X*’c»  > J-A*' u . 


And 

J* 


now,  consider 


- h’c*/b  = J 
> J 


- h'c*/b 

- h 1 u/  b 


6j_  . Then 
since 
( tj  ew , 


j=j* 

and  using 


CL  3- 5.6]) 


so  that 

[3.5. 17]  J*-A*'c»  > J -X* ' u . 


Finally,  we  consider  that  x.(A*)  maximizes 
over  £es+.  From  [3.5.15]  and  [3-5.16]  we  see 
(_x * as  in  CL4.5.  1^])  satisfies  this  criterion, 
from  [L3.5.14]  it  follows  that  £*<  5,  so 
u(  £*  ) =£*  <_  £,  and  J(£*)=J*.  We  have  thus  shown 
of  a suitable  X*  and  2(A*).  [] 


J(x)-X*’u( x) 
that  ^(X*)=£* 
Fur  thermor e , 
that  we  have 
the  existence 


This  also  concludes  the  proof  of  Theorem  3-1.  [] 


3.6  EXISTENCE  OF  STRICTLY  POSITIVE  A 

Under  the  stated  conditions,  Theorem  3-1  showed  the 
existence  of  a suitable  A X).  From  the  proof  of  [L3-5.15] 
we  see  that,  since  we  could  choose  any  support  plane  for  W 
through  £* , and  a range  of  values  of  b,  the  A is  not  unique 
— there  is  a range  of  possible  values.  The  next  Theorem 
shows  that  in  this  range  of  values  there  also  lies  a A which 
is  strictly  positive.  (This  result  will  be  useful  m 
Chapter  4,  for  our  iteration  algorithm.) 
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[T3.6.1]  Theorem  3 -I  I : 

Under  the  conditions  of  Theorem  3-1,  there  also  exists  a 
X*>0  such  that  2(X*)  solves  (AP).  [] 

Proof:  Let  c*eF,  for  some  F6N,  be  as  in  the  proof  of 
[L3-5.14].  Since  F is  a face  of  W,  from  [D3.4.4]  it  can  be 
written 

[3.6. 2]  F = 3nH1nH2n...nHN 

where  the  H1  are  boundary  planes  of  ft.  Since  £*  lies  on 
each  H1 , we  can  write  each  H1  in  the  form 

{ c ! * 

and  by  choosing  the  sign  of  each  h1  such  that 

[3.6.3]  h1'^  < h1,£  for  ail  £6^ , 
we  shall  have  n1>3  (from  [L3-5.6]).  Now  let 


i=  1 


Then  n'c*  <_  £'c  for  all  c e S from  [3.6.3]. 

From  above  we  know  that  £ £0.  We  will  now  Drove,  by 
contradiction,  that  £>0  (strict).  Suppose  n^rO.  This 
implies  h*=0  for  all  i.  Let  2:c*+ae^ , where  e^  is  the  unit 
vector  in  tne  j direction,  and  a>0.  Then  S is  dominated  by 
c*  so  that,  from  [03-5.5],  c eW . Also,  for  all  i=1,...,N 
h^'S  = h^'c*  + ah^'e1-'  - h^'£* 

so  that  ceH1  . Hence  by  [3.6.2],  6 6F . But  c is  dominated  by 
c* , and  this  contradicts  the  property  [L4.5.l2b]  of 
non-dom inated  faces.  Thus  n^>0. 
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The  remainder  of  the  proof  follows  [L3.5.15] 
exactly.  [ ] 

[C3.6.4]  Corollary : If  there  exists  a A >0  such  that  x(A) 
solves  (AP),  then  there  exists  a A*>0  (strictly)  such  that 
A * ) =_x  ( A ) , so  that  x_(A*)  solves  (AP).  [] 

Proof : In  the  proof  of  Theorem  3 —I I above,  simply  take 
c»=u(x(A)).  [] 

[R3-6.5]  Remark : The  aA  condition  is  not  required  for  this 
corol 1 ary . [ ] 

3.7  DISCUSSION 

Everett  [El]  pointed  out  that,  even  under  general 
conditions  on  the  ux(.)  and  S1 , some  statements  could  be 
made  regarding  the  properties  of  an  allocation  SKA),  for  any 
A >0.  He  did  not,  however,  deal  with  the  existence  of 
optimal  multipliers.  Our  results  greatly  extend  the 
applicability  of  the  Lagrange  Multiplier  technique,  by 
giving  simple  conditions  for  the  existence  of  optimal 
multipliers,  under  general  conditions  on  the  u1 ( . ) and  S1  . 
We  also  emphasize  that  in  practice,  our  conditions  are 
likely  to  hold  for  large  systems.  This  justifies  the  use  of 
the  decentralized  method  of  finding  feasible  solutions  to 
large  allocation  problems. 
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APPENDIX  3A 

THE  DELTA-DOMINATION  LEMMA 

For  the  convenience  of  the  reader,  tne  Del ta-Dom  ina t ion 
Lemma  [L3-5.13]  is  re-stated  here: 

F F 

For  any  c 6 F 6 !7 , c +aA  is  dominated  by  some  c*6  Wfi  F. 
(Here  a and  A are  as  in  Theorem  3-1.)  [] 

Outline  of  Proof  of  the  Lemma 


The  proof  proceeds  in  five  stages: 

1.  We  characterize  the  set  of  all  points  in  W/OF,  in 
terms  of  the  resource  usage  vectors  of  individual 
activities. 

2.  We  represent  these  individual  usage  vectors  as 
basis  vectors  in  a different  space,  and  show  how  a 
certain  polyhedron,  P,  represents  F in  this  space. 

3.  We  show  that  a certain  subset  of  the  faces  of  ? 


maps  onto  all  of  F. 


4.  We  derive  a simple  representation  of  any  such  face 
of  P. 

F 

5.  We  take  a point  £ on  F,  and  choose  a certain  point 
on  a suitable  face  of  P.  This  point  will  map  onto 
c*e  wn  F such  that  c*<  c^+aA. 


3A.  1 SET  OF  POINTS  IN  FAW 


Let  Cj  be  a point  of  FAW.  (By  [L 3- 5.1 2d]  FAW  cannot 
be  vacuous.)  Then,  by  [L3-5.12e]  there  exists  an  allocation 
with  £(Xg)  = Cj.  In  this  allocation,  let 


u^j  = usage  vector  of  activity  i 


so  that 


c . = u1  . 

-J  “J 


i=  1 

Now,  for  all  points  c^S  WHF,  let  u^j  be  defined  as  above, 
and  define  the  set 

U1  = LJ  ( u1}  . 

J 

Loosely  speaking,  U is  the  ’’set  of  usage  vectors  of  the  i 
Activity,  for  points  lying  on  FA  J"  . 


Our  first  step  consists  of  showing  that 


all 


combinations  of  these  _u^  according  to  the  manner 


C3A.  1. 1 ] 


I 

s = r ii 

is  1 


for 


- l e i 

u 6 !J 


will  generate  points  c also  lying  in  FAW.  In  other  words, 
we  show  that  the  set  of  all  combinations  c in  [3A.1.1],  is 
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exactly  the  set  of  all  points  of  FA  V. 


Suppose  c^  and  are  both  in  FHW.  Let 


I 


c s r u1 

~ h / — f 

i=  1 


for  u1  6 U1 


c £ r u1 

— o — o 

i=  1 


for  u1  S 'J1 

— o 


and  let  H be  any  singular  boundary  plane  of  F (see 
[D3.4.11])  with  equation 

[ 3A . 1.2]  h'c+  < h'c  for  all  c 6 3 

so  that,  by  definition, 


[ 3A . 1.33 


h’c  = h ' c 
— — (-  — — o 


We  will  show  that  h' (u^ -u^ ) = 0 for  all  i.  Suppose  that 
h'(u^-u_^)<0  for  some  k ( 1 _<  k <_  I).  Consider  the  allocation 


i i ; 

c = ) u where  < 

iTi  v 


(i^k) 

k k 

u - u 
- -o 


Since  c 6 W,  we  must  have 


h'c  < h'c 

— — + — — — 

, , , k k v 

— — +•  — o — + 

C h'c  + 

which  is  a contradiction.  Thus  we  have  shown  that 

for  all  i . 

But  also,  from  [3A.1.3], 


h' (u^-u1)  > 0 
— — o — + — 


h'  (u1-u1)  = 0 . 

— — o — + 

i=  1 


Hence 
L3A.  1 . 4] 

Clearly,  by  definition  of  U1 , and  the  generality  of  c 


h'(u^-u^)  = 0 for  all  i. 


above,  we  can  generalize  [3A.1.4]  to 
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h'Cu^-u1)  = 0 for  all  u^u1  6 U1  ; for  all  i. 

— — o — + — o — ► ’ 

It  is  trivial  to  show  now,  that  for  any  5 as  in  [3A.1.1],  we 
will  have 


h ' c = h ' c 

— Q 


( c as  in  C 3 A . 1 . 3 3) 


so  that  £ also  lies  on  H.  Also  £ e W.  Hence  £ e Hf^W. 

Since  H is  any  boundary  plane  containing  F,  it  follows 
that  £ lies  on  the  intersection  of  all  such  planes  with  W . 
But  this  is  precisely  the  definition  of  F [D3.4.4].  Thus 
c 6 Ff|  W. 


3A.2  REPRESENTATION  OF  F IN  A HIGHER-DIMENSIONAL  SPACE 


In  the  following,  for  ease  of  notation,  we  shall  use 

doubly-indexed  unit  vectors  £ . , by  which  we  mean 

0 


i 

-J 

k 

= V 

— m 

< = > 

i=k, 

j = m 

and  v 1 

. k 
i v 

II 

\/ 

v1  ! 

k 

V 

-J 

— m 

-J  - 

— m 

Now  let  the  N1  elements  of  U1  be  ordered  in  some  fashion, 
say 

■ ■I  r i i i , 

U = fji 

We  shall,  as  shown  below,  associate  these  elements  with  unit 
vectors  in  the  set 

yl  £ { v 1 v 1 v ^ • } 

- — 1 2’ * " * — N1  * 
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Let  V 


I i 

O V1  and  N 
i = 1 


I 

r n1 


dimension  of  span(V). 


Define  the  1 inear  map  T(.)  from  span(V)  to 

span ( U ^ +U . . . +U ^ ) by 

t(4>  8 4 • 

Since  the  v^  are  basis  vectors  for  span(V),  the  above 
equation  defines  T(.)  for  all  points  in  span(V)  [N1]. 


We  now  consider  those  points  in  span(V)  which  can  be 
written 

I . 

[3A.2.1]  v = y v1  for  v1  6 V1  . 

iTi 

Clearly,  T(j/)  = 6,  for  some  6 as  in  [3A.1.1],  so  that  these 
points  correspond  one-to-one  with  all  points  in  F^W.  Let 
P°  be  the  set  of  all  such  v_,  and  P its  convex  hull  (so  that 
by  [P3.4.2]  P is  a Polyhedron). 

Since  F is  the  convex  hull  of' its  extreme  points,  all 
of  which  lie  in  FO  W [ L 3 . 5 . 1 2 d ] , the  map  T transforms  points 
in  _P  to.  po ints  on  F.  (The  map  is  obviously  many-to-one,  in 
general  . ) 
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Remar k : P is  not  a hypercube,  in  general.  For  example,  in 
5 dimensions,  if  we  take 

V1  = {[1 ,0,0, 0,0]' , [0, 1,0, 0,0]’ , [0,0,0, 1,0]’ } 

V2  = {[0,0,  1,0,0]’ , [0,0, 0,0,  1]'  } 
then  the  3-D  projection  of  the  object  P is  the  pr 13m  in 
Fig.3A-1.  (We  are  projecting  both  [0,0,0, 1,0]'  and 
[0, 0, 0, 0,  1 ] ’ into  [0,0,0]'.) 


Fig  . 3A-1  : The  Object  P 
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3A.3  ONLY  NEED  TO  CONSIDER  CERTAIN  FACES  OF  P 

Let  n=dimension  of  F.  In  this  step  we  show  that  the 
map  of  just  the  r-dimensional  faces  of  P (all  r£n) , is 
sufficient  to  cover  F.  This  will  mean  that,  in  later  steps, 
we  need  only  consider  such  faces.  Formally,  we  have: 

C 3A . 3 • 1 3 Lemma  : Let  n = dimension  of  F.  For  any  point  c 6 F, 
there  exists  a point  v^  6 G (an  r-dimensiona I face  of  P,  r£n) 
such  that  c = T(v).  [] 

Proof : (Reminder:  P is  N-dimensional  , F is  n-dimensional . ) 

If  N£n , there  is  nothing  to  be  shown . In  practice  of  course 

we  expect  N>>n.  Now  if  N>n,  let  Y be  the  N-n  dimensional 

null-space  of  T,  and  let  c=T(w)  for  some  W 6 P.  The  linear 

variety  w+Y  can  be  written  as  the  intersection  of  n 

Hyperplanes  in  the  space  of  span(V): 

[3A.3.2]  w+Y  = n H.  . 

i = 1 1 

Consider  (w+Y)OP.  Suppose  this  intersection  is  contained 
entirely  within  a face  of  P,  defined  by  the  intersection  of 

s boundary  planes 

, , n+s 

C3A. 3- 33  . rA , H. 

i = n+ 1 1 

with  P.  (This  statement  is  quite  general,  since  if  s=0  then 
this  intersection  is,  by  convention,  the  whole  space,  and 
the  face  is  just  P itself.) 
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Combining  [3A.3.2]  and  [ 3 A . 3 - 3 3 we  have 

n + s 

(w+Y)r\P  d n H.  . 

i=1  1 

Since  not  all  of  the  s boundary  planes  are  necessarily 
different  from  the  first  n planes,  we  can  rewrite  the  above 
as 

r + s 

C 3 A . 3 - ^ J (w+Y)r\Pc  r\  H.  for  some  r<n  . 

~ i = 1 1 - 

Now  (w+Y)  is  itself  a polyhedron.  Consider  a vertex  of 

this  polyhedron,  which  by  [ 3A.3.4]  must  be  obtained  by  the 
intersection  with  N-(r+s)  more  boundary  planes  of  P,  none  of 
which  is  the  same  as  any  if  in  C 3 A . 3 - ^3  above.  Now  v^  lies 
on  these  N-(r+s)  boundary  planes  of  P,  as  well  as  on  the  s 
boundary  planes  of  P in  C3A. 3. 3],  so  that  it  lies  on  the 
intersection  of  N- ( r+s) +s  = N-r  different  boundary  planes 
with  P.  Thus  v^  lies  on  an  r-dimensional  face  of  P.  But 
also  v_  lies  in  w+Y,  so  that  T(v)  = c as  desired.  [] 

3A.4  REPRESENTATION  OF  r-DIMENSIONAL  FACES  OF  P 


Let  G be  an  r ( £n  ) dimensional  face  of  P,  as  in  the 
previous  section.  Below,  we  derive  a simple  repr esentat ion 
for  any  point  on  G.  Let 

[ 3 A . 4 . I ] v = y~  v*  for  v^ev1  (similar  to  [3A.2.1]) 

i=  1 


and  let  ZK^,...,Zkk  be  such  that 

Zkj  CL  VkJ 

with  vkJ  £ ZkJ 
— o 

ana  cardinality  of  Z J = r j >1, 


w i Lh 


= r . 
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Thus  in  general  , 

[3A.4.2]  t < r 

^ * 

since  some  Z J may  have  more  than  one  element. 

Then  the  set  of  all  points 

C3A.4.3]  v = v + TL(vkj-vkJ)  for  vkJ  6 ZkJU{vkJ} 

— + — O ' — + — o — + — o 

j=1 

is  the  set  of  extreme  points  of  an  r-dimensional  face  of  P. 
In  fact,  extreme  points  of  any  r-dimensional  face  of  P can 
be  thus  represented,  by  appropriate  choice  of  _vq  and  ZkJ  . 
(This  can  be  proved  in  a manner  exactly  paralleling  that  in 
sec . 3A . 1 ) . 

Now  all  points  on  G are  the  convex  hull  of  G's  extreme 
points.  Using  [P3.4.13],  for  any  point  v^  e G there  exist 
r+1  points  v j , v^ , . . . , v1^  e G with 

v™  = vq  + y~ (v^j’m-vkj)  (see  [ 3 A . 4 . 31) 
j— 1 

r+1  r+1 

such  that  v = J~ bmv™  for  some  bm>0 , bm=l 

m=  1 m=~1 

which  implies 

r+1  t . . 

C3A.4.4]  v = V + r bm  y~ ( vX j ’ m-vk j ) . 

m=  1 j=  1 

This  is  the  representat ion  that  we  desire. 
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3A.5  DERIVATION  OF  BOUND  aA 

p 

Let  c be  any  point  on  F.  From  [3A.3-1]  we  know  that 
cF=T(v)  where  v is  as  in  [3A.4.4],  Thus 


r>1  ^ 

cF  = T(v  ) + bm  T^T(vkj;rn-vJ<j)  using  [3A.4.4] 
m=  1 j=  1 

Define  u1  £ T(v*)  so  that  uj  6 U1 

Ukj£  T({vkj}UZkj)  = {u^j,ukJ, . . . ,u  |J.}  (say) 

+ — O — O —I  —"j*  J 

so  that  UkJ  UkJ.  Let  ukj;me  Uk J . 

+ — + 

Then  we  can  write 

F \ ■ i \ . m \ / k -i  j m k-i\ 

£=/H  + / b ) ( u J ’ -u  J) 

i~l  mTi  j=l 

I t rk J . , 

= rvrr 

is  1 j=1  p=0 

k,  k-  • 

where  g j>u  and  g J =1  for  each  j (j=1,...,t)  . 

P p~0 

For  each  k.  (j=1,...,t)  choose  the  constants  hkJ  as  follows: 
J P 


1 . 

For 

p=0 

i to 

rkJ, 

set 

. hkJ=0. 

P 

2. 

For 

p=1 

to 

r kj  , 

if 

gpJ>0.5, 

set  hkj=1. 

P 

3. 

If 

all 

hkJ 

p 

(p=1 

to 

rkJ)  are 

zero,  then  set  hgj  = 1 

[ 3 A .5-13  Remark : From  the 
for  each  k,,  the  above 

J 

1/  . 

(p=0,...,r  J)  and  the  rest 


properties 
procedure 
are  set  to 


k * 

of  j T we  see  that 
P 

^ . 

sets  exactly  one  h J=1 
P 


C 1 


zero . 
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I t J 

Define  c*  = T~  + T~  T“  hk J ( uk J-ukj ) . 

iTi  JTl  pTo  p "P  ~° 

From  [3A.5.1]  we  see  that 
I 

£*  = u1  for  some  £X  e U1  , 
iTi 

so  that  £*  e FA'yf. 

t r k J 

Now  c*-cF  = y~  (gkJ-hkj) (ukJ-ukj) 

j=i  P=o  p 

< 0.5 rL 

< 0. 5nA  ( see  C 3 A . 3.1]) 

< 0. 5(R-1  )A 

where  the  final  inequality  follows  from  the  fact  that  F is  a 
proper  face  of  the  R-dimensional  polyhedron  17.  Thus  we  have 

£*  < £F  + 0 . 5(R -1 )A  [ ] 


-76- 


CHAPTER  4 

THE  "SALA"  (STORAGE  ALLOCATION  ALGORITHM)  TECHNIQUE 

4.1  OVERVIEW  OF  CHAPTER 

In  Chapter  2 we  described  the  Resource  Management 
problem  in  a very  large  system,  and  showed  why  the 
decentralized  approach  may  be  a good  solution  technique  for 
this  problem.  In  order  for  this  approach  to  be  usable  we 
must  overcome  the  problems  described  in  sec. 2. 7 which  are 
briefly  (i)  possibility  of  duality  gaps,  (ii)  convergence  of 
algorithms  requires  stringent  conditions  on  the  functions, 
and  (iii)  slow  convergence  of  such  algorithms. 

With  respect  to  the  first  of  these  problems,  in  Chapter 
3 we  gave  simple  conditions  which  guar an  teed  the  existence 
of  optimal  multipliers  for  the  Artificial  problem,  under 
very  general  conditions  on  the  functions.  Resolving  the 
other  two  of  the  above  problems  then,  shall  be  the  main 
contribution  of  this  Chapter.  We  shall  develop  an  iteration 
algorithm  with  proveable  convergence  properties,  and  a 
quadratic  convergence  rate.  The  reader  should  notice  how 
(both  in  this  Chapter  and  in  Chapter  3)  we  are  able  to 
completely  remove  restrictions  on  the  form  of  the  individual 
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resource  usage  functions,  replacing  them  instead  by  some 
restrictions  on  their  magnitudes.  We  believe  that  this  is 
an  important  philosophy  for  analysis  of  large-scale  systems 
in  general,  since  the  former  requirements  (convexity/ 
continuity/  linearity)  are  seldom  met  in  practice,  whereas 
the  latter  are  easily  verified. 

A minor,  but  nevertheless  interesting,  contribution  of 
this  Chapter  is  the  Selection  Algorithm  (sec. 4. 4).  This  is 
a simple  and  intuitively  appealing  method  of  solving  certain 
inequality  problems,  yet  its  solution,  it  turns  out, 
possesses  useful  minimum  norm  properties. 


4.2  THE  SALA  APPROACH 

Our  approach,  called  SALA  (for  Storage  A1 location 
Algorithm,  a name  which  refers  to  its  original  objective  at 
the  warehouse)  will  be  as  follows:  We  begin  by  observing 
tnat  the  Individual  Problem  (IP)  [2.6.10]  can  be  made  still 
easier.  Then  we  look  for  an  iteration  scheme  to  find  a 
suitable  X*  (suitable  in  the  sense  of  sec. 2. 7).  To  do  this 
we  first  make  some  simplifying  assumptions.  The  resulting 
model  will  be  analyzed;  we  shall  propose  an  algorithm  and 
study  its  properties.  This  will  provide  us  with  insight  as 
to  how  to  extend  our  algorithm  to  the  more  realistic  case. 
Finally  we  shall  prove  convergence  results  for  this  latter 


case  . 
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4.2.1  Preview  Of  Iteration  Scheme 

We  remind  the  reader  of  our  objective  which  (as 
described  in  sec. 2. 7)  is  to  find  a X*  such  that  £(X*)  and 
u(X*)  are  optimal  for  the  Artificial  Problem*. 


Our  iteration  scheme  is  motivated  partly  by 
Arrow-Hurwicz  scheme  described  in  Chapter  2 (sec. 2. 7), 
partly  by  our  existence  theorem  (Chapter  3)-  We  choose 
starting  value  of  X,  say  X 0 , and  then  follow  the  scheme 

k-t- 1 k 

[4.2.1]  x = arg  min  L*(x,X  ) 


[4.2.2] 


v k+1  . k , . 

X = X + AX 


xex* 

k 


the 

and 

a 


until  we  find  an  optimal  X.  The  reader  should  compare  the 
above  scheme  with  [2.7.1]  and  [2.7.2].  Firstly,  our  scheme 
replaces  "max  L"  for  j(6X  by  "min  L*"  for  jxSX*,  where  (see 
next  section)  L*  and  X*  are  such  that  they  further  simplify 
the  solution  to  [4.2.1]  as  compared  with  [2.7.1].  Secondly, 
we  have  a different  method  of  updating  X,  using  the 
Selection  Algorithm  (see  sec. 4. 4),  which  will  lead  to  a 
quadratic  convergence  rate  of  the  scheme  [4.2.1]  and 


[4.2.2]. 


*The  Saddle  point  condition  was  stated  during  our  review  of 
known  theory.  From  the  discussion  at  the  end  of  sec. 3. 2. 3, 
and  the  Theorem  in  sec. 3. 3,  we  see  that  it  is  not  required 
for  our  solution. 
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4.2.2  "Min-Cost"  Allocation  For  Each  Item 

Let  us  consider  the  Individual  Problem  (IP)  [2.6.10] 
once  again.  It  is  easy  to  see  that  if  there  exists  a A such 
that  x.(A)  achieves  the  maximum  value  of  (AP)  [2.6.1],  then 
(IP)  can  be  replaced  by  the  following  Minimum  Cost 
Allocation  Problem* 

[4.2.3]  (MCA):  min  [ X ' u1 ( d 1 , x 1 ) ] 

[jc  i ! e'x^rQM 

This  says  that  for  a given  set  of  "costs"  X,  the  ioln  item 
must  find  that  allocation  (of  all  its  quantity  Q1 ) which 
minimizes  its  total  resource  usage  cost.  Thus,  assuming  the 
existence  of  an  optimal  X (see  next  section),  and  summing 
(MCA)  over  all  i,  we  have  replaced  [2.7.1]  by  [4.2.1], 
provided  we  define 
[4. 2.  4]  L»(x,A)  = A’u(x)  , 

and  X*  as  the  set  of  x such  that  each  it1  in  x^  satisfies  the 
equality  constraint  in  (MCA).  The  iteration  scheme  using  L* 
in  [4.2.1]  then  has  the  following  interpretation:  we  hold 

T 

the  objective  function  J(_x)  at  its  maximum  value  ( j1  ) 

i=  1 

and  try  to  bring  the  total  resource  usage  u(j<)  into  the 
feasible  region  ( < c)  . 


•Reminder:  uMdi,^!)  = vector  of  resource  usages  oy  item  i, 

whose  data  vector  is  d*,  when  its  allocation  is  . 

And  e = [1 , 1, . . . , 1]»  . 
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The  equality  constraint  in  (MCA),  and  the  fact  that  x1 
is  now  an  S-dimensional  vector,  make  (MCA)  a relatively 
simple  problem.  To  illustrate  this  we  note  that  in  the 
special  case  - tne  u‘(dx,.)  are  linear  functions  of  x^1  , 
the  solution  •_  y 1 is  simply  to  allocate  all  the  quantity 
to  one  storage  area. 


4.2.3  Existence  Of  Optimal  X 

The  conditions  for  the  existence  of  an  optimal  A were 
given  in  Theorem  3-1.  Our  proof  there  depended  on  the 
assumption  that  the  strategy  set  for  each  x.1  was  discrete. 
Let  us  simply  restrict  each  jc1  to  a discrete  set.  (This 
means  each  x1  may  assume  only  a finite  number  of  values. 
For  example,  we  could  restrict  all  the  quantity  of  one  part 
to  be  in  one  storage-type  only,  giving  a strategy  set  of 
size  S1  = number  of  storage-types . In  theory,  we  could  let 
this  discrete  set  be  as  finely  divided  as  required,  so  this 
should  not  be  a major  restriction.)  This  restriction 
actually  has  two  advantages:  (i)  we  can  apply  Theorem 
and  (ii)  it  makes  the  solution  of  (MCA)  simpler. 

Now,  the  conditions  in  Theorem  3-1,  stated 
are  "if  we  reduced  the  limits  vector  c oy 
resource  usage  of  about  12  parts,  the  proole-  » 
feasible".  Since  we  are  dealing  with  5' 
condition  is  highly  likely  to  hold.  Thjs  - 
in  proceeding  as  if  an  optimal  A exists 
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4.3  DISCUSSION  OF  MAIN  ASSUMPTIONS 

As  mentioned  in  the  overview,  we  shall  first  make  some 
simplifying  assumptions,  and  analyze  the  idealized  model. 
Let  W be  a bounded  subset  of  the  A space,  which  will  be 
delimited  later.  The  starting  point  for  our  analysis  is 


[A4.3.1]  Assumption : For  all  A e W,  the  function  u(A)  is 
continuous  and  Frechet  differentiable.  [] 

This  assumption  will  be  relaxed  in  sec. 4. 5.  Note  that 
we  do  not , however,  make  any  convexity  (or  concavity) 
assumptions  as  in  Arrow  et.al.  [A1]  or  Zangwill  [Z1],  In 
view  of  [A4.3.1]  we  will  define  the  Jacobian  of  u(A)  at  any 
A"  6 W by  the  matrix  A(A"),  that  is 


[D4.3.2]  De  f in  it  ion 


Ai  j > 


8 iHi 
»AX4 


A = A" 


[] 


[L4.  3.  3]  Lemma  (Singularity  of  PO:  For  any  A"  <2  W we  have 

A ( A " ) A"=  0 (] 


Proof : If  all  costs  are  increased  in  the  same  proportion, 

then  from  (MCA)  no  allocation  will  change,  that  is 

u(A"  + hA")  = u(A") 

and  since  this  is  true  for  arbitrary  h,  the  directional 
derivative  of  u(A")  in  the  direction  A"  must  be  zsru.  L J 

[C4.3.4]  Corollary : The  FI-dimensional  function  u(  . ) of  the 
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R-dimensional  variable  X is  (at  most)  an  R-1  dimensional 
surface  (in  the  u space).  This  can  also  be  seen  from  the 
fact  that,  since  the  scale  of  X is  arbitrary,  the  surface  u 
can  be  described  completely  by  fixing  one  component  of  X and 
letting  the  other  R-1  vary.  [] 

[ D 4 . 3 . 5 ] Definition  ; The  feasible  region  in  u space  is 
defined  as 

F = {u  ! 0 £ u £ c}  [] 


Since  by  [ A 4 . 3 . 1 ] the  tangent  hyperplane  to  u(X)  exists 
for  all  X 6 W,  we  introduce  the  following  concept: 

[D4.3.6]  Definition  ( Pseudo-Fea  Sibil  ity ) : u(\)  is 

Pseudo-Feasible  (PF)  w.r.t.  F at  X = X" , if  the  tangent 
hyperplane  to  u( . ) at  X"  passes  through  the  region  F (see 
Fig. 4-1).  [] 

In  terms  of  this  concept  we  have  our  next 

C A 4 . 3 . 7 3 Assumption  : u(X)  is  PF  w.r.t.  F,  for  all 

x e w.  U 

The  motivation  for  this  is  that  a fir3t-order 
approx imation  to  u(X)  should  have  a non-empty  intersection 
with  the  feasible  region.  This  assumption  will  be 
considerably  relaxed  in  sec. 4. 6. 
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Fig . 4-1 : Illustration  of  Pseudo-Feasibility. 
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We  shall  also  simplify  the  system  configuration  with 
[A4.3.8]  Assumption : The  matrix  A satisfies  A..<0  and 

J J 

A i j >0  (i*j).  [] 

[R4.3.9]  Remark : (This  assumption  will  be  removed  in 

[4.5.28]  — it  is  required  for  the  Selection  Algorithm  only, 
not  for  our  general  convergence  proof.)  In  any  case  A.,  must 

J J 

be  non-positive  (explained  below).  Our  assumption 
strengthens  this  to  strictly  negative.  The  two  conditions 
on  A are  part  of  the  conditions  that  make  ( -A ) an  M-matrix 
(see  Lemmas  [L4.4.13]  and  [L4.4.15]).  An  example  of  a 
matrix  satisfying  A^ >0  would  be  in  a system  with  storage 
constraints  only;  or  alternatively  a system  in  which  we 
could  identify  one  critical  resource  in  each 
storage-type* . [] 

[4.3.  10]  Summary  of  First  Set  of  Assumptions : For  the 

convenience  of  the  reader  we  list  these  briefly  below. 

[ A 4 . 3 . 1 ] u(X)  continuous,  differentiable. 

[A4.3.7]  Pseudo-Feasibility. 

[A4. 3. 8]  A . .<0  and  A.  .>0. 

j <J  ^ J 


•The  reason  for  this  is  as  follows:  From  (MCA),  an  increase 
in  X.  (with  all  other  X.  constant)  cannot  cause  parts  not 
using ^resource  j to  decideto  U3e  it,  and  in  fact  it  may 
cause  some  parts  using  resource  j to  move  to  another  area. 
Thus  we  have  Ajj£0  and  Aj.>0  (i*j). 
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4.4  THE  SELECTION  ALGORITHM 

4.4.1  Outline 

The  purpose  of  this  algorithm  is  to  derive  a value  for 
AA , the  change  in  A for  the  next  iteration  (see  [4.2.2]). 
The  algorithm  does  this  by  using  the  information  resulting 
from  the  current  iteration,  i.e.  u(A)  and  A(A)  . (We 
assume,  for  the  moment,  that  A(A)  is  known.) 

Let  us  say  we  have  a A=A"  such  that  u(A")  & F.  We  are 
looking  for  a change  Au"  such  that 

u( A " ) + Au"  < c 

or  to  a first  order  approximation  (using  [A4.3.1]),  we  want 
AA  such  that 

u(  A" ) + A ( A" ) AA  < c 

(For  notational  simplicity  we  sha 11  drop  the  dependence  of  A 
on  A".)  Defining  Au=c-u(A")  we  want 
[4.4.1]  A AA  < Au 

We  need  a method  to  solve  this  inequality,  and  in  fact 
to  choose  from  all  the  possible  solutions  for  AA.  (The 
existence  of  a solution  is  guaranteed  by  [A4.3.7].)  Clearly 
we  should  not,  in  general,  try  to  solve  the  equality 
A AA  = Au.  (This  would  attempt  to  move  to  the  point  where 
all  the  constraints  are  active,  which  may  not  even  be 
physically  rea  l i^ah  l , and/or  may  not  lie  on  the  tangent 
hyperplane:  either  of  these  factors  will  cause  numerical 

problems.)  Our  method  is  to  select  only  certain  components 
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of  X"  to  be  changed,  and  to  look  at  the  equation 

. sA, s . s 
A AX  = Au 

5 

where  AA  is  a vector  containing  only  the  selected 
components  of  AA,  similarly  with  Au  , and  A is  a submatrix 
of  A containing  only  the  rows  and  columns  corresponding  to 
the  selected  components.  We  solve  this  for  AX3= ( A3 ) ~ 1 Au3 . 
Then,  in  the  full  vector  AX,  we  set  all  selected  components 
to  their  cor respond ing  values  in  AX  , and  all  unselected 
ones  are  kept  zero.  We  check  to  see  whether  AAA  is  indeed 
< Au.  If  not,  we  modify  the  selection  to  include  other 
components,  and  repeat  the  procedure. 

The  Selection  Algorithm  is  described  in  detail  in  the 
next  section.  Its  idea  is  to  find  a solution  to  [4.4.1] 
which  is  satisfactory  (in  some  sense),  while  at  each  stage 
exerting  the  minimum  effort  that  appears  necessary.  For 
example,  in  a typical  operation  of  the  Algorithm  with  R=24, 
we  may  find  we  have  to  invert  a 4x4,  then  6x6  and  then  10x10 
matrix,  at  which  point  a suitable  AX  is  found.  This  takes 

■j  q -3 

(4  J+6 -’♦1  0J  )k  =1230k  operations  (say),  whereas  inverting  a 
24x24  matrix  takes  24^k  s13,824k  operations.* 


•Strictly  speaking,  inversion  of  an  Nx N matrix  takes  of  the 
order  of  N 2 - 8 1 operations  [A2].  However,  this  assumes 
implementation  of  Strassen's  multiplication  algorithm,  which 
is  seldom  the  case  in  practice.  Therefore  we  assume  the 
more  usual  n3  value. 


# . 
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The  power  of  the  Selection  Algorithm  however,  stems  not 
so  much  from  its  computational  savings  as  from  its 
properties  which  are  described  by  the  next  result. 

[T4.4.2]  Theorem  4-1 

Let  the  Assumptions  14.3-  10J  hold  for  X = X" , and  suppose  that 
u(X")  £ F.  Then  if  X">0  we  have 

[T4.4.3]  The  Selection  Algorithm  (see  next  section) 
terminates  before  all  R components  of  X"  are 
selected  , 

[T4.4.4]  The  AX"  so  found  satisfies 

AX"  = arg  min  { ||AX||  | u( X " ) +A ( X" ) AX  < c } 

AX  >0 

where  ||^||  can  be  either  the  Euclidean  Norm 

()~y^)0'^,  or  the  1^  Norm  ( m?x|yi|  ). 
iTi 

[] 


Proof:  See  next  section. 

Remarks : There  are  two  properties  of  interest  in  [T4.4.4] 
above.  The  first,  that  the  Algorithm  gives  AX"  0.  This 
implies  that  if  we  had  X">0,  we  can  be  sure  that  X"+AX">0 . 
This  is  an  important  condition  for  the  next  application  of 
the  Selection  Algorithm,  and  for  several  of  our  other 
results  (sec. 4. 4. 6 and  4.5).  The  second  appealing  property 
is  that  among  all  suitable  AX  > 0,  it  finds  the  one  of 
minimum  norm.  This  will  be  important  for  our  convergence 


proof  in  Theorem  4 —I  I . 
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4.4.2  Description  Of  Selection  Algorithm* 

The  description  of  our  method  is  greatly  facilitated  by 
the  concept  below. 

[D4.4.5J  Definition : A selection  vector  s is  defined  as  a 
vector  of  0's  and  I's  only.  Let  s be  an  R-d imensional 
selection  vector,  with  n components  equalling  1,  and  let 
J3= { j 1 , . . . , j n ) be  the  set  of  indices  of  these  components. 
For  any  R-d imensional  vector  ^ we  define  the  operation  of 
selection  ( // ) 

[4.4.6]  s(#)£=[y.  ]'  [] 

Conceptually,  s(//)^  "selects"  only  those  components  y^ 
for  which  s^=1,  and  "deletes"  all  others.  Similarly,  for  an 
RxR  matrix  A,  s(//)A  "selects"  only  those  elements  A^  for 
which  s^  = 1 and  s^  = 1.  Thus  s(7)A  is  the  submatrix  of  A 
obtained  by  deleting  all  rows  i with  s^=0,  and  all  columns  j 


with  sj=0* 

Formally , 

[ D 4 . 4.7] 

s(,7)A  S 

rA 

.ji  ji 

A . 

. J 1 Jn 

A • . • • • 
JnJ  1 

A . 

Jn  Jn 

*The  details  of  this  algorithm  are  not  essential  to  the 
understand ing  of  the  rest  of  this  Chapter.  The  reader  may 
omit  this  section  if  desired.  Tne  properties  of  the 
algorithm  in  Theorem  4-1  however,  are  important,  and  should 
be  noted. 
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We  also  define  a "reverse"  operation  to  selection  (for 
vectors  only),  as  follows: 

[D4.4.8]  Definition  of  Distr ibution  Operation  (*):  For  s 

and  n as  above,  and  for  an  n-dimensional  vector  z,  we  define 
^"=s(*)z  to  be  an  R-dimensional  vector  such  that 

(i)  z = S(//)^" 

(ii)  If  Sj=0  then  y 'J  = 0 [] 


[4.4.9]  Examples : s= 

r ^ 
i 

y= 

’52 

As 

2 5 6 

0 

12 

4 8 3 

1 

- . 

L18J 

7 8 1 

Then 


s(  If ) y= 


52 
1 8 


and 


s(#)As 


and 


s(  * ) [ s(  If  )y  ] = 


52 
0 
J 8 


4.4.3  Details  Of  Selection  Algorithm 

We  are  trying  to  find  a AA  satisfying  the  inequality 
[4.4.1].  Armed  with  our  new  notation,  we  can  express 
concisely  our  algorithm  for  finding  such  a AA  ( a detailed 
explanation  follows  the  algorithm). 


I 


-90- 


[4.4.  10]  Algorithm  (Selection  Procedure) 


DEFINE:  A = A(A")  an  Rx R matrix  (constant) 

Au  = £-u ( A " ) an  R-vector  (constant) 
s = an  R-vector  (variable) 

AX  = an  R-vector  (variable) 

Au"  = an  R-vector  (variable) 


INIT-S: 


for  r=1  to  R 
do  if  Au  <0  then 
r else 


end 


= 1 
= 0 


CALC  : set  AX  = s(  * ) [ ( s( // ) A ) ~ 1 ( s( // )Au)  ] 


CHECK  : set  Au"  = AAA 

if  Au"  < Au  then  STOP 


NEXT-S:  for  r=1  to  R 

do  if  sp=1  then  nothing 

else  if  Au">Au  then  s =1 
. r r r 

end 

go  to  CALC 

CJ 


4.4.4  Explanation 

In  the  initial  stage  (INIT-S)  the  algorithm  sets  £ to 
select  only  those  components  of  Au  which  are  negative,  say 
Au  =s(//)Au.  These  components  correspond  to  the  violated 
constraints.  It  then  uses  the  correspond ing  submatrix  of  A, 
say  A3  = s(//)A,  to  solve  the  linear  equation 
[4.4.11]  ASAAS  = AuS 

The  resulting  AAS  is  filled  out  with  zeroes  in  the 
unselected  components  to  get  AA=s(*)AA  in  the  step  CALC. 
Now  the  algorithm  sees  whether  this  AA  is  satisfactory 
(CHECK).  The  method  in  CALC  assures  us  that  any  selected 
component  r will  always  satisfy  Auj[  = Aur  (proved  below),  so 
the  check  need  only  be  done  for  the  remaining  components. 
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Any  violations  cause  their  respective  components  to  be 
included  in  the  next  selection  (in  addition  to  the 
previously  selected  components)  and  the  process  is  repeated. 
The  termination  of  the  algorithm  will  be  proved  below. 
First  we  illustrate  the  algorithm  with  a simple  example. 


4.4.5  Example 


Let  all  terms  be  as  in  DEFINE  of  [4.4.10]. 


Given  A"= 

1 

and 

A = 

'-4 

2 

2 

0 

, c = 

'l  0 

, u ( A" ) = 

20 

1 

3 

-6 

2 

1 

10 

7 

1 

0 

1 

-2 

1 

10 

5 

l1) 

0 

1 

2 

-3 

10 

4 

Note  that  our  example  satisfies  A.A"  = 0,  as  must  be  the  case 
by  [L 4. 3.3J.  In  DEFINE,  we  also  calculate 


Au 


[-10,  3,  5,  e; 


Then  the  algorithm  proceeds  as  follows: 

INIT-S : This  sets  s=  [ 1 , 0,  0,  0]'. 

CALC  : (Terminology  as  in  sec.4.4.4  above.) 

Au3=[-10],  A3=[-4].  Hence  4As=(AS)-1a 
and  AA=[2. 5,  0,  0,  0]  ' . 

CHECK  : Au"=AAA=[-10,  7.5,  0,  0], 
which  is  not  < Au  (above), 
so  the  algorithm  continues. 

NEXT-S:  Sets  s = [ 1 , 1,  0,  0]’  . 

CALC  : Au3=j’-10j,  A3=  j"-4  2j  =>  AA3=  pj 
Hence  AX  - [3,  1,0,0]'. 

CHECK  : Au " =[-10,  3,  1,  1]',  which  is  < Au, 
so  the  algorithm  terminates,  with 
AA  = [3,  1 , 0,  0] ' . 


This  step  sets 
u3= [2 . 5 ] , 
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(Note  that  in  each  step  CHECK  above,  the  selected  components 
of  Au  exactly  equal  those  of  Au"  , as  will  be  proved  below.) 

[] 

4.4.6  Proof  Of  Theorem  4-1 

The  proof  is  by  way  of  a series  of  Lemmas. 


[L 4. 4. 1 2 ] Lemma  (Equality  of  Selected  Components ) : In  the 

step  CHECK  of  the  algorithm  [4.4.10]  we  have  Au'[]  = Aur  for  all 

r such  that  s = 1 . [ ] 

r 


Proof : (Notation  as  in  sections  4.4.3  and  4.4.4  above.)  Let 

J and  n be  as  in  [4.4.5]  that  is,  r6J  <=>  s =1.  From 
s s r 

[4.4.11]  we  have,  for  i 6 Jg 

au\ 

k=1 

Since  Au" =AAA  = A [ s( * )AA3] , we  have  for  i 6 Js 


UR 

Au--  - pA..AA. 


jej 


= Au? 


XlAijAXj  + rAij.O  using  [D4.4.8] 


j6J 


[ ] 


For  the  next  Lemma  we  shall  need  the  following  result: 


-93- 

[4.4.13]  Hawkins-Simon  Conditions*  (see  for  example  Nikaido 

[M2]):  Consider  the  system  of  equations 

[4.4.14]  D X = b 

wnere  D is  a square  matrix  with  nonpositive  off-diagonal 
terms.  Then  the  four  conditions  below  are  all  equivalent**: 

(I)  [4.4.14]  has  a solution  X >0  for  some  b >0. 

(II)  [4.4.14]  has  a solution  X >0  for  any  b >0. 

(III)  The  leading  principal  minors  of  D are  positive. 

(IV)  All  principal  minors  of  D are  positive. 

[ ] 

[L4.4.15]  Lemma  (Modified  Hawkins-Simon  Condition) : Let  D 

be  as  in  [4.4.13],  and  define  the  condition 

(I')  The  diagonal  terms  of  D are  strictly  positive, 
anc  [4.4.14]  has  a solution  X >0  for  some  b >0. 

Then  < I ' ) =>  { (I ) , (II ) , ( III ) , (IV)  } [] 

Proof : It  is  sufficient  to  show  (I')=>(III),  since  the  rest 

follows  from  [4.4.13].  Our  proof  follows  Nikaido's  proof  of 
(I  ) = > ( 1 1 1 ) [N2]  with  appropriate  modifications  for  condition 
(I').  Let  X be  n-d imensional . The  proof  is  by  induction  on 


•Matrices  satisfying  these  conditions  are  also  known  as 
M-matr ices  . 

**The  reader  should  note  carefully  the  differences  (i.e. 
> or  > ) in  the  various  inequalities  in  ( I ) . ( 1 1 ) , ( I ' ) . 
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n.  If  n=1,  then  (III)  is  true  since  D^>0.  Now  assume  that 
for  n>1,  (I ' )=>(III)  is  true  for  n-1.  Let  us  subtract 

4.  L— 

Di i /D i i times  the  first  row  from  the  i & one  in  [4.4.14], 
giving 


[4.4.16-1]  D i -|A i +D  12^2  + * • • +D  In^n  = &1 
[4.4.16-2]  D$2A2+. • .+D2nXn  = bj> 


[4.4.16-n]  Dj!}2A2  + . • • +Dfln^n  s bft 

where  (for  i,j=2,...,n)  we  define 

D*r-Dij-D  i 1 D i j/D  ii,  so  that  Dfj£0,  i^j 
bf  =bi  -Dj.ibj.  /D  n , so  that  bf 

This  means  that  the  n-1  dimensional  system  of  equations 
[4.4.16-2]  to  [4. 4.  16  — n]  satisfies  the  conditions  (I'),  so 
that 


D?2  •• 

D2k 

D&2 

• • 

• • 

• D$k 

for  any  k,  (2£k<n) 


Hence,  by  the  familiar  rules  of  determinants, 


Dn  • • • D1k 
d21  •••  D2k 

II 

Dn  d12  ...  D1k 
0 d£2  ...  Djk 
• • • • • • 

=D  1 1 

d?2  • • • D2k 
• • • • • 

Dk 1 •••  Dkk 

0 D$2  . . . D£k 

°&2  •••  Df?k 

> 0 for  any  k,  (2£k£n) 
[] 


so  that  (I')=>(III)  is  true  for  n. 
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IZ 


[L4.4.17]  Lemma : Let  A=A(X")  where  X">0.  Let  As  be  any 

5 

selected  matrix  of  A (i.e.  A = s(  // ) A ) of  dimension  n<R. 
s 

Then  for  any  X of  dimension  n 

ASXS<°  =>  Xs>0 

and  ASX s<0  =>  XS>0  [] 


Proof : Let  us  reorder  the  components  of  A such  that  s 

selects  the  first  n.  We  have  from  [L4.3.3]  that  AX"=0. 
Thus 


[4.4.18]  A X"  = f~  ( - A . ) X ' ! 
j=1  J J j = n+ 1 J J 

Now  from  [A4.3.8]  we  have  A..<0  and  A.  .>0  (i/j).  Thus  since 

J J ^ J 

X">0,  for  1£i£n  the  RHS  of  [4.4.18]  is  nonpositive,  so  that 
ASX"S<0  where  X"S£s(//)X",  or 

[4.4. 19]  (-A3)  X"S  > 0 

Thus  [4.4.19]  satisfies  condition  (I')  of  [4.4.15],  which 
implies  from  condition  (II)  of  [4.4.13]  that  for  any  Xs, 

( -As )XS  >0  =>  XS>0 

which  proves  the  first  statement  of  the  Lemma. 


Now  let  A3Xs=b<0  (strict).  This  implies,  a fortiori, 
that  Xs>0.  Then  for  i<n, 


j = k 


A“Al  bl  j=in^i)  AijAj 

Since  bi<0,  A-^CO,  A ^ ( i ^ j ) > and  X3>0,  we  have  X^>0. 

[] 
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[L4.4.20]  Lemma  ( Non-negat i v i ty  o f AM:  At  all  stages  of 
the  Algorithm  [4.4.10],  provided  s selects  less  than  R 
components,  the  step  CALC  results  in  AX  >0.  [] 

1 2 

Proof : Let  s ,s  , . . . be  the  successive  selection  vectors 

tried  in  the  algorithm.  We  can  renumber  the  components  of 
Au  so  that 

k k 

s selects  components  1 to  m only. 

Clearly  rn1<m^<...  (from  [L4.4.12]  and  step  NEXT -3).  Now 

denote  the  intermediate  stages  of  CALC  as  follows: 

Auk  = sk(//)Au 

Ak  5 sK(//)A 

AXk  £ (Ak)_1Auk 

so  that  the  step  CALC  can  be  written 

AA  = sk(*)AXk 

At  the  first  selection  (INIT-S)  we  have 

A 1 AX 1 =£u 1 < 0 

which  implies  AX ^ >0  using  [L4.4.17].  Hence  AX  >0  for  k=1. 

u 

We  will  now  prove  A_X  >0  by  induction  on  k. 

Suppose  AX  >0,  so  that  AA  >0,  but  the  check  fails  (step 

k+ 1 

CHECK),  leading  to  a new  selection  vector  _s  (step 

NEXT-S),  and  then  in  CALC  a new  value  of 
[4.4.21]  AAk+1  = (AK  + 1 )"1Auk‘"1 

k + 1 

(Note  that  we  no  longer  have  Au  £0  so  tnat  we  cannot  use 

[L4.4.173  as  we  did  above).  Let 

v = sk+1(/))  [sk(*)AAk] 
k+  1 


w = AA 


-v 
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Then  by  definition  of  v and  AuK  * 

j<mk  =>  v.=AAk 

J J 

j>mk  =>  v.  = 0 

sJ 


j<mK  =>  Auk+1=Auk 
J J 


From  [4.4.21]  we  have 


[4.4.22] 
For  i<mk 


Ak'*’1w  = Au  k + 1 -A  k 1 v = b (say) 


.k+1 

bi=Aui  + 1-  ’jTA^Vj 


.K  .k 


Au.  - T Ak.AAK 
i ij  J 

J=1 


(by  calculation  of  AA;. 


And  for  m <i<m 


. k+ 1 j = +L  .k+1 
b.=  Au.  - Y A..v. 
i i i—iJJ 

J=1 

= Auk+1-[A(sk(*)AAk)]i 
= Auk+1  -Au'? 


since  these  components  failed  the  check  in  step  CHECK.  We 

thus  have  t5  £0  in  [4.4.22],  Now  if  mk+^<R  then  from 

[4.4.17]  it  follows  that  w £0.  Using  the  strict  inequality 
k k+ 1 

on  b^^  (m  <i£m  ) above,  and  proceeding  as  in  [4.4.17],  we 
can  show  that  w^>0  for  (mk<i£mk+1).  Hence 


k 1 is 

for  i£m  AA*  rv^+w^^sAA^+w^O,  and 

k k+1  k+1 

for  m <i<m  AA.  =v.+w.=  0 +w.>0, 

— ill  l 


k+  1 

so  chat  AA  >0, 
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[L4.4.23]  Lemma  (Termination  of  Algoritnm) : The  selection 
algorithm  [4.4.10]  terminates  before  all  R components  are 
selected . [ ] 


Proof : First  we  shall  prove  that  the  algorithm  cannot  lead 
to  the  selection  of  all  the  R components.  The  proof  is  by 
contrad iction . 


With  notation  as  in  the  preceding  Lemma,  suppose  that 

k 1 k 

we  have  s and  s such  that  m <R  and 

[4.4.24]  mk+1=  R 
k+ 1 

so  that  s selects  all  the  components.  Now  with 

k k* 

AX  S sk(»)ax 

Au"  £ A AX 

we  must  have  Au'J=Au^  for  1^i£mk  (by  [L4.4.12])  and  also 

[4.4.25]  Au'?>Au.  for  li/xiXR 

since  these  components  were  selected  in  step  NEXT-S. 


I 


Now,  from  [4.3.7]  we  know  there  exists  some  v such  that 
A v_  < Au.  Let  w=^>bX"  where  b is  a scalar  and  X"  is  as  in 
[L4.4.17].  Then  (using  [L4. 3-  33)  Aw  = Av^  < Au.  Since  X">0, 
we  can  also  choose  b such  that  wo=0. 


D 1 

Define  s to  select  the  first  R-1  components.  Then, 
with  the  usual  notation,  we  have 

AR ~ 1 AX R ~ 1 > AuR ” 1 (since  AXR=0,  and  using  [4.4.25]) 

.R-1  R-1  . , R-1  , , 

A w v Au  (since  wR  =0 ) 

. .R-1 , R-1  .R-1 . . . 

=>  A ( w -AX  ) < 0 

. R-1  . ..R-1 

s>  w > AX 


( using  [ L 4 . 4. 17  3) 
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> 0 (since  AA >0  ) . 

Now  let  Au*=Aw.  Then,  using  the  preceding  steps,  and  the 
fact  that  ARj>0  for  j<R  (from  [AM. 3.8]),  we  have 

i-R  R — 1 R— 1 

^uR  = ll  AR  j wj  = TL  ARjWj  > r_ARj^j 
j=1  j=1  j=1 

JfR 

= ) A R j AA j > AuR  (using  [4.M.25]) 

J="l 

which  contradicts  Aw  < Au.  So  [4.M.2M]  cannot  be  true. 

Since  m is  strictly  increasing  in  k,  but  m <R,  the 
algorithm  must  terminate.  [] 


[L4.4.26]  Lemma  (Minimum  Norm  Prot 


Let  AA"  be  the 


value  of  AA  when  the  Algorithm  [4.4.10]  terminates.  Then 

AA"  = arg  min  { ||AA||  ! AAA  < Au  } 

AA  >0 


Proof:  Suppose  the  algorithm  terminates  when  k (as  in 
preceding  Lemma)  equals  n.  We  then  have  found  a AA  such 
that  AAA  < Au,  with  AnAAn=Aun  (notation  as  before). 


Now  consider  any  w >0  such  that  Aw  < Au . Then  for 


. x n 
i<m  , 


jsm  j = R 

rAu“jiAv  l #uwj 


, n , 
j = m +1 

< Auj^  for  i*j) 

=>  Anwn  < Aun 

=>  An(wn-AAn)  < 0 

= > «n  > Aa"  using  [ L 4 . 4 . 17]. 
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Also,  for  i>mn  we  have  AA^=0,  so  tnat  f°r  all 

1_<i<R,  from  which  the  minimum  norm  property  is  clear 
( irrespective  of  which  of  the  two  norms  in  Theorem  4-1  is 
used ) . [ ] 


The  last  two  Lemmas  conclude  the  proof  of  Theorem 
4-1.  [] 


4.5  ITERATION  SCHEMES  FOR  THE  IDEALIZED  PROBLEM 
4.5.1  Restricted  System  Configuration 

Retaining  for  the  present  the  Assumption  that  u(A)  is  a 
continuous  function  of  A whose  Jacobian  A(A)  can  be 
calculated,  and  also  that  each  area  uses  its  own  resource, 
we  can  hypothesize  the  following  iteration  scheme  to  solve 
the  Artificial  Problem  [2.6.1]: 
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[4.5.  1 ] Algorithm  ( SALA -1  Iteration  Scheme) : 

INIT  : Given  some 

Set  \o-xinit 

Set  k=0 


MCA 

TEST 

JACOBIAN 

SELECTION 

UPDATE 


Use  A^  to  do  a Min-Cost  Allocation  [4.2.3] 
for  each  item,  and  calculate  uiAk) 

If  u(Ak)  e F then  STOP 

Calculate  A(Ak) 

Use  Selection  Algorithm  to  calculate  AAk 

Set  A**1 =Xk+AAk 
Set  k = k+1 
go  to  MCA 


The  next  theorem  will  deal  with  the  convergence 
character istics  of  the  above  scheme. 


4.5.2  Definitions  And  Assumptions 

In  this  section  we  perform  the  groundwork  necessary  for 
a convergence  analysis. 


[D4.5.2]  Definitions: 


d ( u , F)  = min  ||  u-u"||  [ ] 

u"6F 


[4.5.3]  Remarks:  The  norms  on  u and  A are  the  usual  1 


norms,  with  the  factor  1/c.^  included  to  make  the  norm 
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independent  of  the  scale  of  measurement  of  the  ith  resource. 

It  is  easily  verified  that  with  the  1„  norm  on  AX  (as  in 

oo  — 

[T4.4.4]),  the  above  definitions  satisfy 

[4.5.4]  Au=AAX  =>  ||Au||  < ||  A||  .||  AX|| 

The  definition  of  d(.,.)  is  the  usual  distance  from  a point 

to  a set.  We  note  that  if  u £ F 

~ u. -Cf 

[4.5.5]  d(u,F)  = min  ||  u-u"||  = max  -- 

u"eF  ~ { i | u . >c  . } c 

11  [ ] 

[A4.5.6]  Assumption : u(X)  is  Pseudo-Feasible  w.r.t.  pF  for 
all  X 6 W,  where  p< 1 . (In  this  case  we  say  u ( . ) is  Strictly 
Pseudo-Feasible,  or  SPF.  This  is  a stronger  restatement  of 
[4.3.7].)  [] 

[A4.5.7]  Assumption : For  all  X,X"  6 W, 

||  A(X)-A(X")  ||  < D * ||  X-X'*  ||  (D  is  some  constant) 

[] 


[4.5.8]  Remarks : [4.5.6]  assures  us  that  the  tangent 
hyperplane  enters  the  interior  of  F,  wnile  [4.5.7] 
effectively  bounds  the  second  derivatives  of  u(.).  We  may 
draw  a parallel  with  convergence  proofs  of  Newton’s  method 
for  finding  the  root  of  a vector  function  [L4,L5J.  In  such 
proofs  it  is  customary  to  see  [4.5.7],  while  our  [4.5.6]  is 
analogous  to  the  assumption  there  that  the  gradient  of  the 


function  is  non-ze^o  at  the  root 


[ 3 
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[D4.5.9]  Definitions : 


a.  T (A ) 

= min 

{ 

||AA||  ! 

u(  A ) + 

A ( A 

)AA  < c } 

AA  >0 

b.  T" ( A ) 

= min 

{ 

I|AA||  : 

u(A ) + 

A(A 

)AA  < pc  } 

AA  >0 

where 

P 

is  the 

value 

in 

[ A 4 . 5.  6]. 

c.  SEL(A)  = any  A_A  minimizing  [D4.5.9a]  above. 

(The  notation  reminds  us  that  such  a value 
is  generated  by  the  Selection  algorithm.) 

[] 

[4.5.10]  Remark : The  definitions  of  T(.)  and  T" ( . ) are 
motivated  by  Theorem  4-1  (see  [T4.4.4]).  We  use  the  1 

oo 

norm  on  AA  so  that  relation  [4.5.4]  is  valid.  Note  that 
T(.)  and  T" ( . ) are  sc al ar s while  SEL(.)  is  a vector . [] 

[D4.5.11]  Definition : W"  J W/1U  ! u(A)>  c}  [] 

[4.5.12]  Remark : W"  is  that  region  of  W for  which  u(A)  is 
not  in  the  interior  of  F.  [] 

[L4.5.13]  Lemma : For  A 6 A",  T”(A)  and  ||A(A)||  are  nor.-zero 
and  bounded  above.  [] 

Proof:  (see  next  section). 


[D4.5.14]  Definitions: 

I = A(A) 

e « sup  T'-(Aj 

* = Aew  -flp" 


[4.5.15]  Remark : It  is  in  view  of  the  previous  Lemma  that 
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we  can  define  the  above  quantities,  since  we  are  assured 
that  both  quantities  exist  and  are  non-zero.  [] 

[4.5.  16]  Surnmar y of  Assumptions  ( Second  Set ) : The  reader 

should  compare  these  with  the  first  set  of  Assumptions. 

[A4.3.1]  u(A)  continuous,  differentiable. 

[A4.5.6]  Strict  Pseudo-Feasibility. 

[ A 4 . 3 • 8 ] A..<0  and  A > 0 . 

J J ^ J 

[A4.5.7]  Bounded  second  derivatives. 

4.5.3  Convergence  Analysis  1 

[T4.5.17]  Theorem  4 -I I (Convergence  of  SALA -1  Scheme) 

Let  Assumptions  [4.5.16]  hold  for  some  W,  and  let  the 
constants  D,£  be  as  already  defined.  For  some  A°  6 W, 

(X°>0)  let  the  following  conditions  hold: 

(i)  ll^1-^°ll  <a>  where  A 1 = A0+SEL(A°) 

(ii)  e<1,  where  e = atD/2 

Mow  let  W'  = {A  i || A ^ “A °||  <b}  where  b=a/(1-e). 

k 

Then  if  W'  C.  W,  the  SALA-1  scheme  generates  a sequence  {A  } 
which  converges  to  some  A*  6 W'  such  that  u(A*)  & F. 

Furthermore,  the  average  order  of  convergence  of  the 
Algorithm  is  at  least  two.  [] 

Proof:  This  will  be  via  a series  of  Lemmas. 


A 
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Proof  of  Lemma  [ L 4 . 5. 13  3:  This  follows  easily  from  the 


definitions . 

Let 

A"=arg  min 

II  A ( A ) ||  over 

Aew 

" . Since 

u( A " ) > c> pc  , from 

the  SPF  assumption  [A4.5.6] 

we 

must  have 

A (A  " ) >0  (or 

else  [D4.5.9b] 

would  have 

no 

solution)  . 

Secondly,  since 

W 

(and  hence  W" ) 

is  bounded , 

from 

[A4. 5.  7] 

the  finiteness 

o f 

||  A (A  )||  (Aew") 

is  clear . 

Similarly, 

the  SPF  property 

assures  us 

that 

for  all 

A6W"  , there  exists  some  aA  to  solve  the  RHS  of  [D4.5.9b],  so 
T"(A)  exists  and  is  finite  in  W"  . Clearly,  since  in  W"  we 
have  u(A)>pc  and  j|A(A)||  finite,  we  must  have  T"(A)>0.  [] 

1 T (A ) 

[L4.5.18]  Lemma:  For  A e W"  , - < ---------  < t [] 

- a ~ d(u(A),F)  - 

Remar  k : This  will  be  the  key  Lemma  for  our  convergence 
proof,  since  it  will  establish  bounds  on  the  ratio  of  the 
norm  of  AA  to  the  distance  from  the  feasible  region.  [] 

Proof : This  relies  on  the  minimum  norm  properties  of  the 

definitions  above,  and  on  the  SPF  assumption. 

Consider  some  ASW"  and  let  AA"  be  the  minimizer  of  the 
RHS  of  CD4.  5-  9b]  . Then  letting  A_u"  = A(A)AA"  we  have* 

[4.5.19]  jlu'?  < pc^-u^ 

Now  consider  aAA"  as  a candidate  for  solving  the  RHS  of 
[D4. 5. 9a] , where 

•Whenever  there  is  no  risk  of  confusion,  we  shall  denote 
u(A)  simply  by  u. 
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/u . -c . 
inf/  l l 

[4.5.20]  I = l i ! u. -c . >0 } and  a = Jell  - 

1 1 VUi'PGi 

Clearly  , 0<a< 1 . N 

Let  u"  = u( X ) +A  (X  ) [ aAX " ] = u + aAu"  • 

We  will  show  that  u"  G F. 


For  [i  ! ui-pci<0},  u'?  < ui + a(  pc  .^u  i ) 

< u . +pc . -u . 

— l i i 


using  [4.5.19] 
(since  a< 1 ) 


= P<=i  < <v 

For  [i  ! pc^<u^<c^},  u'?  < u^  + aCpc^-u^) 


< c . 
l 


For  i e I, 


u'.'  < u . +a( pc . -u . ) 

l—l  li 

= ui-a(ui-pc.) 

< v'vV 


using  [4.5. 19] 
(by  definition  of  i) 
using  [4.5.19] 


using  [4.5.20] 


Thus  u"6F  and  aAX"  is  a member  of  the  RHS  of  [D4.5.9a]. 
follows  from  the  minimum  norm  property  of  [D4.5.9a]  that 


T (X ) < 


a AX  " 


. inf 

< iei 


= aT" ( X ) 


' u . -c . 

l l 


c . 

1 


f'Ui-pc  i\ 


1 

U,  -C  \ 


T"  ( X ) 


supj  1 1 

i ei \ — 


so  that  we  have 


T"  ( X ) < -----  T"  (X  ) 

inf[Ui“PC^  " ' ^ 


it 


T(  X ) T " ( X ) 

< ~-  < « from  [4.5.14] 

d(  u(X ) , F)  ~ 1 -p  ~ 


which  proves  the  RHS  inequality  in  the  Lemma. 
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Now  let  Au=A(A)AA,  for  AA  minimizing  [DU. 5. 9a]. 

Then,  since  u+Au  6 F, 

d(u,F)  < d(u,u  + Au)  = || Au)|  < ||  A ( X ) ||  .||  AA|| 

= ||A(A)||T(X)  < aT  (A ) 

which  proves  the  LH3  inequality  in  the  Lemma.  [] 

[LU.5.21]  Lemma:  The  sequence  (X  } in  [TU.5.17]  converges 
to  some  X*  6 W ' with  u(A*)SF.  [] 

Proof : The  existence  of  each  X^  , k>0,  is  guaranteed  by  the 

SPF  property  provided  we  can  show  that  the  sequence  remains 
in  W.  Assume  th3t  [X  } satisfies 

[4.5.22]  || 1 1|  < (a/e)e2j_1 

for  j=1 ,2,...  . We  then  would  have 

[4.5.23]  |j Xn-A°||  < r ||X^1-X^||  < (a/e) 

' j=0  j=0 

n- 1 . 

<a  eJ  (since  e<1  ) 
j“0 

£ a/ ( 1 -e)  = b . 

Thus  we  would  have  Xne  W'  c W for  all  n>0. 

To  show  that  [4.5.22]  is  true,  suppose  it  is  true  for 
j = 1 , 2,  . . . , k . Let  u"  £ u(Xk_1 ) + A(Xk_1 )[Xk-Xk_1  ],  which 

implies  u"eF  by  definition  of  SEL(Xk  "*  ) . Also  [4.5.23] 

i k k 

implies  A ew 1 for  j=1,...,k.  Let  u = u(A  ).  We  consider  tne 

k k 

two  cases  u SF  and  u 0F  separately. 
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If  uk  0F  then  Xk6W’C.W  =>  Xk6W"  , so  using  [L4.5.18]  we 

have 

||Xk+1  -Xk||  = T(Xk)  < td(uk,F)  < td(uk,u") 

= t ||u(Xk)  - u(Xk_1)  - A(Xk_1  )CXk-Xk_1  ] || 

< (tD/2)||Xk-Xk-1||2  using  [4.5.7] 

< (a/e) e 

If  ukeF,  clearly  SEL(Xk)=0  so  that  the  following 
inequality  holds  trivially 

Lf+1  \r  pl< 

II X1 K -A  ||  =0  < (a/e)e  . 

Thus  in  either  case  [4.5 .22]  holds  for  j=k+1.  Now 
|| X1  -X°||  < a = (a/e)e2° 

and  so  by  induction  it  follows  that  [4.5.22]  is  true  for  all 
j = 1 , 2 , . . . . 

n- 1 , 

Clearly  |j X n -X^H  < a eJ  , so  that  {X*}  is  a Cauchy 

j = m 

sequence  in  W'  and  converges  to  some  A*6W'.  It  remains  to 
be  shown  that  u(X*)6F.  From  [L4.5.18], 
d(u(X1:),F)  < aT  (.  Xk)  5 a||  1 -*k||  ->  0 

so  that  d(u(X*)  ,F)=0,  i.e.  u(A*)0F  by  closure  of  F.  [] 

[D4.5.24]  Definition  ( Average  Order  of  Convergence  , AOC ) : 

We  use  the  definition  in  [L5,  p.129],  adapted  for  vector 

norms.  Let  X ->  X*.  The  AOC  is  the  infimum  of  the  numbers 

p>1  such  that 

limsup  ||  Xk-xsj|  ^ 1 = 1 
k ->oo  ~~  _ 

where  "limsup"  denotes  limit  superior.  [] 
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[L4.5.25]  Lemma:  The  AOC  of  the  sequence  {Ak}in  Theorem 
4 - 1 1 is  at  least  two.  [] 


Proof : Consider  the  sequence  of  terms  v^Cq)  - [ we ^ ^ ^ 

where  0<w<oo  , and  e<1.  We  have 

1 im  v,(2)  = lim  w^1/2^e  = e < 1 
k ->oo  k->oo 

while  for  q>2, 

1 im  v ( q ) = lim  A 1 > e < 2/t> > k , 

k->oo  k->oo 

so  that  inflq  ! lim  v,  (q)  =1}  is  two. 
q>  1 k ->oo  K 

Now  for  the  sequence  {A  } we  have 

< f. 

j = k 

00  - i 

< ( a/ e)  J~ e * 

j~k 

= (a/e)e2kfe2k(2j-n 

j = 0 

2k 

ae 

<  . 

e(  1 -e) 

„ . a 

Since  we  have  e< 1 and  0<  <co  , by  comparing  the  above 

e(1-e) 

with  the  results  on  vk(q)  we  deduce  that  the  AOC  of  {Ak}  is 
at  least  two.  [] 


The  proof  of  Theorem  4-II  is  now  complete.  [] 
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4.5.4  Remarks  On  Theorem  4-II 
4.5.4. 1 Existence  Of  Optimum  - 

The  reader  may  ask  that,  since  AX  is  always  >0,  how  can 
we  be  sure  that  for  arbitrary  choice  of  A*nit  in  the 
algorithm  [4.5.1],  there  exists  a solution  A*  >_  Ainit?  In 
Chapter  3 we  gave  sufficient  conditions  for  the  existence  of 
some  optimal  A.  In  this  Chapter  we  make  a stronger 
assumption,  the  SPF  assumption,  which  implicitly  assures  the 
existence  of  such  a A.  However,  we  do  know  that  if  such  a A 
exists,  then  there  also  exists  some  A*>0  (strictly)  which  is 
also  a solution  (see  Corollary  to  Theorem  3 —I  I ) . Now  the 
scale  of  A*  is  arbitrary**,  and  only  the  relative  magnitudes 
of  the  components  matter  (see  [4.3.3]  and  [4.3.4]).  Thus 
for  any  Ainit,  there  exists  a b>0  such  that  bA*>^  Aini-t  and 
bA*  is  a solution. 


4. 5. 4. 2 Cond it  ions  In  The  Theorem  - 

These  are  similar  to  conditions  used  in  convergence 
proofs  of  Newton's  method  and  its  extensions  [L4.L5.R1].  In 
fact  our  result  is  similar  to  Robinson's  [R1],  but  our 


**In  Chapter  3,  the  scale  of  A*  was  determined  by  the  choice 
of  the  constants  P*-  in  tKe  Artificial  Problem  [3.2.5]. 
Since  these  constants  are  themselves  arbitrary,  we  are  not 
contradicting  the  statements  in  Chapter  3. 
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method  is  quite  different.  His  proof  assumes  A is 
nonsingular,  which  is  not  so  in  our  case  (see  [L4.3.3]),  and 
he  uses  several  properties  of  convex  processes  CR2, R33-  Our 
proof  on  the  other  hand  depends  mainly  on  Lemma  [L4.5.18], 
whose  proof,  in  turn,  relies  on  the  minimum  norm  properties 
and  the  SPF  assumption.  Intuitively,  we  have  replaced  the 
condition  that  the  range  of  A be  the  whole  space,  by  the 
(weaker)  condition  that  the  range  of  A include  some  point  in 
the  interior  of  the  set  F-u(A)  . 

4.5.5  General  System  Configuration 

In  the  above  convergence  proof,  the  crucial  properties 
used  were  the  minimum  norm  properties  in  [D4.5.9].  It  is 
seen  that  the  restriction  on  the  system  configuration  in 
Assumption  [A4.3.8]  was  to  ensure  these  properties  for  the 
selection  algorithm.  From  the  insight  given  us  by  the  use 
of  these  properties  we  propose  a general  programming 
problem,  the  Minimum  Norm  Problem: 

[4.5.26]  MNP:  Find  AA  minimizing  ||A  A|| 

a.  subject  to  A+AA  Amin 

b.  and  u(A)+A(A)AA  < c 

Here  Amin  is  a given  strictly  positive  vector.  This  1 

— oo 

norm  problem  can  be  solved  quite  simply  using  Linear 
Programming  [Z2],  since  min  |1  AA||  is  equivalent  to  min  y 
subject  to  y> AAi  and  y^-AA  for  all  i.  Thus  this  problem 
can  be  solved  using  known  mecn  J . In  that  case,  we  can 
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general  i ze  our  iteration  scheme: 


[4.5.27]  Algor ithm  ( SALA -2  Iteration  Scheme) : 


I NIT  : 

Given  some  Xrn:*-n>0,  and  some 
Set  X°=Xinit 

Set  k = 0 

Xinit> 

xmin 

MCA  : 

Use  Ak  to  do  a Min-Cost  Allocation  [4 
for  each  item,  and  calculate  u(Ak) 

.2.  3] 

TEST  : 

If  u(Ak)  & F then  STOP 

JACOBIAN  : 

Calculate  A(Ak) 

MNP  : 

Calculate  AAk  as  in  [4.5.26] 

above 

UPDATE  : 

Set  Ak+1  =Ak+AAk 

Set  k=k+1 
go  to  MCA 

We  shall 

now  consider  the  convergence 

character istics 

of  this  scheme. 

[4.5.28]  Third  Set  o f Assumptions : We  can  considerably 

relax  the  restrictions  in  our  second  set  of  assumptions,  to 
get : 

[ A 4 . 3 . 1 ] u(X)  continuous,  differentiable. 

[A4.5.6]  Strict  Pseudo-Feasibility. 

[A4.5.7]  Bounded  second  derivatives. 

[L4.5.29]  Lemma:  At  the  kth  iteration  of  the  algorithm 
[4.5.27],  there  exists  some  AAk  which  solves  MNP.  [] 

Proof : This  will  be  by  induction.  Assume  that  for 
j = 0,  1, . . . ,k-1 , AAJ  exists.  Then,  by  the  properties  of 
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AX 


k-1 


. k .k-1  ..k-1  . . min 

X = X + AX  2 h. 

Now  let  A_u  = £-u ( Xk ) , and  A = A(Xk)  . From  the  SPF 

assumption,  we  know  there  exists  w such  that  Aw  £ Ajj.  Also, 

1/ 

from  [LU.3-3],  AX  =0.  Hence  for  any  scalar  b, 

ACw+bX*)  < Au  . 

Choose  b large  enough  so  that 

. k . ,,k.  . .min 

X + (w+bX  ) > X 

Then  AX  = w+bXk  is  feasible  for  MNP.  Thus  the  set  of 

feasible  solutions  to  MNP  is  non-vacuous.  By  closure  of 

this  set,  there  must  exist  a solution  of  minimum  norm, 
u 

Hence  AX  exists. 


Now,  for  j=0  we  have 

= xinlt  > x1"1" 

so  that  by  the  above  argument  AX°  exists.  Thus  by  induction 


AX  exists  for  k=0,1,2, 


[] 


[DU. 5. 30]  Definition : We  modify  [DU. 5. 9]  to  get 


a . 

b . 

c . 


T (X ) ) 

) 

T" ( X ) ) 

) 


SEL(X) 


Let  these  be  defined  as 
in  [DU. 5. 9],  except  that 
the  minimizations  be 
carried  out  over  all  AX 
such  that 


X+AX  > X 


min 


[] 


[LU.5.31]  Lemma:  The  bounds  in  [LU.5.18]  hold  for  the  new 
assumptions  and  definitions  (i.e.  [U.5.23]  and 

[DU. 5. 30]).  [] 
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Proof : Refer  to  the  proof  of  [L4.5.18].  Ak  below  is  the 

same  as  A in  that  proof.  We  showed  there  that  a^A 11 
satisfied 


u(Ak) 

+ 

A(Ak)[aAA"]  < c 

Now 

A k ♦ AA" 

> A1"10 

by 

[D4.5.  30] 

and 

also  Ak 

> Amin 

see 

[L4.5.29] 

so  A k + aA A"  = ( 1 -a) Ak  + a(Ak  + AX") 

(1-a)A  ♦ a(A  ) 

. min 


so  that  aAA"  satisfies  both  the  inequalities 
[4.5.26a  and  b] . The  remainder  of  the  proof  is  exactly  as 
in  [ L 4 . 5.  18].  [ 3 

[T4.5.32]  Theorem  4 -I  II  (Convergence  of  SALA-2  Scheme) : 

The  statements  in  Theorem  4-II  remain  valid  for  assumptions 
[4.5.28]  and  definitions  [D4.5.30],  with  the  SALA-2 
iteration  scheme.  [] 

Proof : Theorem  4 —I  I was  proved  via  the  following  Lemmas: 

[L4.5.13]  Bounds  on  T"(A)  and  A(A)  . 

[L4.5.18]  Bounds  on  ratio  T(A) /d( u(A  ) , F)  . 

[L4.5.21]  Convergence  to  A*,  with  u(A*)SF. 

[L4.5.24]  Order  of  convergence. 

The  reader  can  verify  that  by  adding  [L4.5.29]  to  the  above 
3ct  of  Lemmas,  and  lepiacing  [L4.5.18]  by  [L4.5.31],  the 
proof  of  Theorem  4 — 1 1 1 is  complete.  [] 
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4.6  EXTENSION  TO  REALISTIC  CASE 

In  this  section  we  relax  several  assumptions,  in  order 
to  extend  our  results  to  more  realistic  cases.  First  we 
shall  find  an  alternative  to  the  continuity  assumption 
[ A 4 . 3 • 1 ] • Next  we  shall  consider  the  problem  of  defining 
and  calculating  a suitable  replacement  for  the  Jacobian 
A( . ) . Then  we  shall  considerably  weaken  the  SPF  assumption, 
to  one  which  is  likely  to  hold  in  practice.  We  shall  use 
all  these  ideas  to  propose  a modified  iteration  scheme,  and 
then  prove  convergence  of  this  scheme. 


4.6.1  The  Underlying  Function 

We  now  relax  the  unrealistic  assumption  TA4.3.1],  that  u(X) 
is  a continuous  differentiable  function.  The  sources  of 
d iscon t inui t ies  in  u( . ) are  twofold:  (i)  discontinuities  in 
the  usage  functions  of  the  individual  items,  that  is 
u1(dJ‘,.)>  and  (ii)  an  item  "jumping"  from  one  allocation  to 
another  as  X passes  through  some  critical  value  for  that 
item.  Observe  that,  due  to  the  size  of  the  problem,  the 
resource  usage  of  an  individual  item  will  be  quite  small  in 
comparison  to  the  limits.  Thus  we  postulate: 
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[A4.6.1]  Assumption : There  exists  some  continuous, 
differentiable  R -dimensional  function  u(X)  , which  we  call 
the  underlying  function , such  that 


|QrU)-ur(X)| 

c 


< e <<  1 
u 


and  whose  Jacobian  A(.)  is  such  that  for  all  X.X+AX  e W 

||A(X*4A)-A(X)||  *11  A||  2 
< en  <<  1 

114*11  ' Q 


wnere  ||  A||  is  as  in  [D4.5.2].  [] 


[R4.6.2]  Remark : The  subscript  Q is  for  "Quadratic  error". 
The  terms  in  the  above  definition  have  been  chosen  so  as  to 
make  and  e^  dimensionless,  that  is  independent  of  the 
scale  of  measurement  of  X and  ur(A).  (Compare  e^  with  D in 
[A4.5.7],  and  remember  that  division  by  c^  is  implicit  in 
||  A||  . ) The  above  assumption  is  reasonable  in  practice  --  for 
instance,  if  several  tens  of  thousands  of  items  use  resource 
r,  tnen  the  discontinuities  mentioned  above  will  be  of  the 
order  of  the  usage  by  one  iten,  so  we  can  expect  e^-0.0001  . 
See  Fig. 4-2  for  an  illustration  of  Q(.).  The  assumption 
is  equivalent  to  saying  that  the  character istics  of  the 
items  are  not  bunched  together,  so  that  too  many  items  do 
not  change  their  allocations  for  a small  change  in  A. 
(Without  the  bound  or.  the  second  derivatives  of  Q,  we  could 
always  find  a 0 arbitrarily  close  to  u.)  [] 


Fig . 4-2 : Illustration  of  Underlying  Function. 


Let  Aj  vary,  with  A^Ckij)  constant.  We  can  think  of  u^  as  a 
function  of  A^  alone,  giving  us  a cross-section  of  the  u 
versus  A space  as  above. 
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4.6.2  Calculation  Of  Suitable  Jacobian 


If  we  could  (somehow)  estimate  the  gradient  matrix  A of 
the  above  function  u(  . ) , we  would  be  able  to  apply  our 
algor ithm . Ofcourse  0 is  a hypothetical  function  and  cannot 
be  observed.  We  could  use  finite  difference  methods  on  u(  . ) 
to  approximate  A,  but  such  an  approximation  would  not,  in 
general,  satisfy  the  conditions  that  held  on  A (see 
[L4. 3- 31)  which  were  necessary  for  the  Lemmas  which  proved 
Theorems  4-1  and  4-III  (and  also,  4-II  depended  on  4-1). 


Our  method  is  to  construe t an  approximation  A in  such  a 
way  tnat  the  above-mentioned  conditions  hold.  The  usual 


finite-difference  approximation  would  be  to  let 


Au  ■ 


u . ( A+he~’ ) -ui  (A  ) 


where  e^=  unit  vector  in  the  j direction.  A simple  "trick" 
enables  us  to  construct  a more  satisfactory  estimate  A.  Let 
A*  be  the  current  value  of  A during  an  iteration.  Let 

hj  £ ChX^f,  . . . ,hAk,0,  . . . ,0] ' 
with  h°  = 0 

where  h is  some  constant,  typically  0.1  . Then  we  calculate 


our  estimate  A by 


[4.6.3J 


ui(Ak  + h'^)  - u^A^  + h^'1) 


[ L 4. 6 . 4 i Lemma : The  estimate  A above  satisfies  the  property 
of  A in  [L4.3.3],  that  is 

A(Ak)  Ak  = 0 [] 
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Pr  oof  : n y~  A..(Ak)Ak  = J~  [ ( A^+ji^  ) -u  . ( A^  + h''  ^ ) ] 

J=1  J=1 

= u^(A  + h )-u^(A  + h ) 

= ui(Ak+hAk)-ui(Ak) 

= 0 . 

Ths  final  step  follows  by  the  same  reasoning  as  in  [L4.3-3J 
that  is,  a proportional  change  in  all  the  costs  will  not 
lead  to  a change  in  the  allocation  --  a fact  which  is 
independent  of  the  form  of  the  u(  . ) function.  [] 

[R4.6.5j  Remark : We  see  that  the  above  method  evaluates  A 

k k k 

at.  intermediate  points  along  a path  from  A to  A +hA  . This 

means  that  any  given  term  A.  . is  designed  to  estimate 

->Zi  | 1J 

-rr"  ,*  Li  rather  than  ~ q*  • However,  for  small  h,  we 

i*j  | i 

do  not  expect  the  difference  to  be  significant.  The 
important  point  is  that,  although  both  techniques  involve  an 
estimation  error,  in  the  chosen  method  this  error  does  not 
destroy  a useful  property  of  A.  The  utility  of  the  above 
result  is  that  the  Selection  Algorithm,  if  used  on  the 
estimate  A,  will  still  satisfy  the  statements  in  Theorem  I 
(with  A replaced  by  A),  and  some  of  our  existence  proofs 
(e.g.  [L4.5.29])  for  Theorem  4-III  also  remain  valid.  [] 

In  Theorem  4-IV  we  show  that  (a  slightly  modified 
version  of)  the  SALA-2  iteration  scheme  will  in  fact 
converge  to  a solution,  even  in  the  presence  of  errors 
between  the  estimate  A and  the  "underlying"  gradient  A.  To 
measure  these  errors  we  have 
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[D4.6.6]  Definition:  eA  = max  ||  A ( X ) — A ( A ) ||  *||X|| 

A xew 

where  ||A||  is  as  in  [D4.5.2].  [] 

[R4.6.7]  Remar  k : Tine  term  ||  X||  (and  the  c^  implicit  in 
[D4.5.2])  make  d imension 1 ess  (see  remark  [R4.6.2] 

above ) . [ ] 

C A 4 . 6 . 8 ] Assumpt ion : eA  <<  1 [] 

[4.6.91  Justification : We  show  here  that  the  above 

assumption  is  reasonable.  First  note  that  the  scale  of 

measurement  of  c^  is  arbitrary,  and  all  our  definitions  are 

independent  of  this  scale.  Thus  for  any  given  X,  we  can 

re-scale  the  c and  A (keeping  c X constant,  so  that  no 
r r r r 

allocation  changes)  to  got  each  X ^ = || X||  . Now  is  as  in 

[4.6.31*  Since  the  second  derivative  of  0 is  small,  a good 

estimate  of  A . . is 

^ J * ’1 

a.  (x+hJ ) - a . (A+hJ“  ) 

A.  . = 

hX  . 

J 

2e  c . 

_ u 1 

Then,  using  [A4.6.1],  IA..-A..I  < 

1 J 1 J 1 ~ hX  . 

J 

and  thus  ||A||  -||  A-A[|  5 “fx  ---  f I 

°1  jTl  1J  J 

R 

< (2eu/h)  J-  (||X||  / X j) 

= ( 2 R e u ) / h . 
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Thus  for  e -0.0001  (see  [R4.6.2]),  h-0.1,  and  R-20,  we  have 
u 

eA<0.02  . [] 

4.6.3  Relaxing  The  SPF  Assumption 

Throughout  sec. 4. 5 we  relied  on  the  SPF  assumption 

[A4.5.6].  If  Qk=Q(Ak)  is  quite  far  from  the  feasible  region 

F,  this  assumption  can  easily  be  violated  in  practice  (see 

Fig. 4-3).  This  situation  can  be  further  aggravated  by 

errors  in  the  estimate  A.  Let  us  consider  an  enlarged  set 

of  constraints,  c^r  ( 1 +d'i< ) c where  dk  = d( uk  ,F)  . the  factor 

(1+d*)  is  the  smallest  factor  by  which  F must  be  enlarged  in 

order  that  it  contain  jj  (Fig. 4-4).  Thus  a more  reasonable 

assumption  to  [A4.5.6]  would  be  to  say  that  the  tangent 
k k 

hyperplane  at  u intersects  p(1+d  )F  for  some  p< 1 (Fig. 4-5). 

k 

Since  this  enlarged  region  is  much  "nearer”  to  Q , we  would 

k - k 

also  expect  that  small  numerical  differences  between  (u  ,A  ) 
k * k 

and  (u  ,A  ) will  not  destroy  the  validity  of  this  assumption 
«k  *>a  k 

(we  use  A for  A(A  ),  etc.).  We  now  state  this  formally. 

[A4.6.10]  Assumption : (Numerical  Pseudo-Feasibility,  NPF ) 
k [/  a k 

For  any  A GW , the  hyperplane  u +A  dA  (for  varying  AA)  has  a 
non-empty  intersection  with  the  region  p(1-t-d<)F,  for  some 
p< 1 . [ ] 
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Fig . 4-5:  Illustration  of  the  NPF  Assumption. 

It  is  more  reasonable  to  expect  that  H will  intersect  the 
region  p ( 1 *»-d  )F,  than  to  expect  it  to  intersect  F. 
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4.6.4  Iteration  Scheme  For  Realistic  Case 

We  have  relaxed  several  of  our  unrealistic  assumptions, 
in  order  to  deal  with  practical  situations.  In  terms  of  our 
new  concepts  we  have  our  next  iteration  technique. 

[4.6.11]  Algorithm  (SALA -3  Iteration  Scheme) : 

INIT  : Given  some  Xmin>0,  and  some  Amin 

Set  X°=Ainit 
Set  k=0 

INITMCA  : Use  A°  to  do  a Min-Cost  Allocation  [4.2.3] 
for  each  item,  and  calculate  u(A°) 

Set  u°=u(A°) 

TEST  : If  uk  6 F then  STOP 

JACOBIAN  : Calculate  Ak  = A(Ak)  as  in  [4.6.3] 

DISTANCE  : Set  dk  = d(uk,F) 

Set  £k  = (1 +dk) c 

MNP  : Calculate  AAk  = arg  min  IIAAII  subject  to 

uk+AkAA  < p£k  " 

Xk+AA  > puTT 

LINESRCH  : Find  a = arg  min  d ( u( Ak+aAA k ) , F) 

UPDATE  Set  uk+1  = u(Ak+aAAk) 

Set  Xk+1 =X^+a4Xk 
Set  k=k+1 
go  to  TEST 

[R4.6.12]  Remark : The  numerical  computation  of  A can  be 
done  quite  efficiently.  If  the  effort  involved  in 

evaluating  the  vector  u(A)  is  T units  of  CPU  time,  from 
[4.6.3J  we  see  that  A(A)  can  be  calculated  in  at  most  RT 
additional  units  of  time.  In  practice,  since  many  data 
items  calculated  for  u(A)  can  be  used  for  calculating 
u(A+h^),  the  additional  effort  is  less  than  RT  units  of 


-126- 


time.  We  have  also  added  a Line  Search  (LINESRCH)  as  in 
most  implementations  of  Nonlinear  Programming.  This  can 
also  be  done  using  well-known  numerical  techniques  C L 5 ] . 
Usually  a fairly  crude  search  suffices,  and  does  not  involve 
too  many  evaluations  of  u(  . ) . The  value  of  p has  been  left 
unspecified  in  the  above  algorithm.  It  can  be  either  chosen 
adaptively  during  the  algorithm,  or  kept  as  a user 
parameter,  so  that  the  user  could  "tune"  the  algorithm  for  a 
particular  problem.  (The  adaptive  method  would  start  with  a 
low  value,  and  bring  the  value  closer  to  1 if  either  the  MNP 
or  LINESRCH  steps  were  unsuccessful.) 


4.6.5  Convergence  Analysis 

[A4.6.13]  Assumption : Let  eu>  eg,  e^,  and  p be  as  already 
defined  in  [A4.6.1],  [D4.6.6],  [A4.6.10],  and  let 

b = 2eu  + eft  + (eg/2) 

so  that  b<<1.  Then  there  exists  # with  0<p<1-b-p.  [] 

[A4.6.14]  Assumption : The  AX  calculated  in  MNP  satisfies 

Il4*kll  < IIAkll  • n 

[R4.6.15]  Remark : There  is  a relationship  between  the 


various  assumptions  in  this  analysis.  In  remarks  following 
[A4.6.1]  and  [A4.6.8]  we  have  already  justified  b<<1  in 
practice,  allowing  a choice  of  p close  to  1 in  [A4.6.13] 


-127- 


above.  This  makes  the  NPF  assumption  more  reasonable,  since 

it  brings  p(1+dk)F  very  close  to  uk . It  also  aids  [A4.6.14] 

k 1/ 

since  AX  will  be  smaller  for  p(1+d  )F  close  to  u^  . 

[A4.6.14]  can  be  further  justified.  We  have  already 

remarked  in  [4.6.9]  that  it  is  possible  to  re-scale  the  c r 

so  that  ^rr||^||  • Since  in  MCA,  an  item  decides  its  use  of 

resources  depending  on  their  relative  costs,  we  can  expect 

that  a change  of  the  order  of  ||X||  will  cause  a major 

change  in  the  usage  of  the  jth  resource.  Loosely  speaking, 

the  step  MNP  only  seeks  to  bring  down  the  most  over-used 

resources  by  a factor  of  p (which  is  close  to  1).  So 

||  AXk||  < ||Xk||  is  reasonable.  [] 

[4.6.16]  Summar y of  Assumptions  (Final  Set):  We  have 

considerably  relaxed  all  our  assumptions,  and  in  addition 
shown  why  the  ones  below  are  likely  to  be  true  in  practice: 
[A4.6.1]  Underlying  function. 

[ A 4 . 6.8]  eA  <<  1 . 

[A4.6.10]  Numerical  Pseudo-Feasibility. 

[A4.6. 13]  0 < p < 1-b-p. 

[A4.6.  14]  ||AXk||  < ||Xk||  . 
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[T  4 . 6 . 17]  Theorem  4 -I  V (Convergence  o f SALA -3  Sc  heme ) 

Let  assumptions  [4.6.16]  hold.  Then  the  SALA-3  algorithm 

'[4.6.11]  converges  to  a solution  in  a finite  number  of 

k 

iterations.  Further,  if  d is  as  in  [4.6.11],  and  p as  in 

[4.6.13],  then  the  convergence  rate  is  characterized  by 

k+ 1 

either  d =0  (i.e.  solution  found) 
or  d^*^  ( 1 — ) d ^ - p [] 

[R4.6.18]  Remark : We  note  that  in  the  worst  case,  the 

convergence  rate  is  still  better  than  geometric.  []  \ 

Proof : First  we  note  that,  since  A satisfies  [L4.6.4],  the 

k 

existence  of  a A>.  to  solve  MNP  at  each  iteration  can  be 
proved  in  the  same  way  as  [L4.5.29].  We  develop  the 
remainder  of  the  proof  in  4 stages: 

1.  The  line  search  is  replaced  by  a specific  choice  of 
a.  Clearly,  the  line  search  in  tne  algorithm  will 
always  do  at  least  as  well. 

2.  We  develop  an  error  equation  relating  the  estimated 
resource  usage  to  the  actual  usage. 

3.  We  prove  upper  bounds  on  the  actual  usage. 

k 

4.  We  show  that  d decreases  according  to  the  equation 
abov  e . 


The  details  now  follow. 


-129- 

Choice  of  a 

From  [A4.6.13]  we  can  let  p=1-b-p-q  for  some  q>0.  Let  us 
choose  a such  that 

2eu  + js  . 

[4.6.19]  < a < 1 . 

2eu+jS+q  - 

From  this  we  deduce  that 

a(2eu+p+q)  > 2eu  + p 

=>  a(2e  +1-p-b)  > 2e  + p 

u — u 

= > a(  1 -p-eg/2-eA ) >_  2e^  + p using  [A4.6.13J 

and  thus 

[4.6.20]  a(1-p)  - <p  > 2e  +ae  +ae_V2 

— u a y 

an  equation  we  shall  use  later  in  this  prqof. 

Error  Equation 

Write  jj  for  jj(A)  and  u3  for  Lj(A  + aAA)  , for  simplicity. 
Define  Au  = A(A)AX,  so  that  u + Ajj  £ pck  by  MNP.  Also  let 
y = II  a*  II 7 II  All  <1  by  [A4.6.14],  Then  we  have 

3 k 

u.  = u.(X  + aAX)  < Q . ( X+aAX ) +c . e using  [A4.6.1]  and  c >c 

L 1 — ~ ~ x ““  x 3 — — 

eQllaAAH2 

< □i(X)  + [A(X)aAX]  . + c. ---  ♦ c ‘e  using  [A4.6.1] 

2||X||2 

< u. (A)+[A(A)aAX] .+[A(X)-A(X) ] aAX  + ( ay ) 2 ( e./2  ) ck+e  ck 

— l-  - — l — — — Q lui 

< Q . ( X ) -*-aAu  . +e  . ck  ( ay ) + ( ay ) 2cke_/2  +e  c*  using  [D4.6.6] 

— l-  l A l l Q ui 

Ui  ui  Au 

= > "k  1 ^ a~~~  '■2eu+eA(ay)>(eQ/2)(ay)':  using  [A4.6.1] 


* 
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u . Au  . 

l l 

< --  + a 

“ k k 

ci  ci 


2eu  * eA  * V2 


since  a< 1 , y< 1 


u . 


Au, 


so  that 

u!  -i  -i 

[4.6.21]  a + a(1-p)-p  using  [4.6.20] 

k — k k 

c . c . c . 

1 i 1 

which  is  our  desired  error  equation. 


Bounds  on  the  u^ 


First  we  prove  a minor  result: 

[4.6.22]  a<  1 and  u^<c^  =>  u^  + a(  pc^-u^ ) < c|*-a(1-p)c* 

Proof : 


C ] 


a ( c^-u i ) 

< 

k 

crui 

= > 

, k k k. 

a(ci-pci-ui+pci) 

< 

crui 

= > 

a( 1 -p) c^+a( pc^-ui ) 

< 

°i"ui 

II 

V 

ui-a( pc^-ui ) 

< 

k , . 
c^-a( 1 

tk 

'i 


[ ] 


New  consider  two  sets  of  indices: 


J = (i  ! ui<Gi^ 


and 


K = [ i ; c*<u.} 


We  have  Au^pc^-u^  (by  step  MNP) 
From  [4.6.21],  for  iej, 


so  that  aAu. <a( pc  -u . ) . 

l—  i i 


a , k . 

u . u . + a( pc . -u . ; 

1 1 ii  , , 

--  < + a C 1 -p ) - <f> 

_k  ~ k 

c . c . 

l l 


< 1 - (5 


using  [4.6.22] 


Ir 

and  for  i6K,  since  u^c^  , 

..a 


, k k . 

-i  ui+a(pci-ci) 

--  < + a(  1 -p)  - p 

k ~ k 

ci  ci 

ui 

= --  - p . 

k 

ci 


Equa  cion 


for  dk 


Note  that  from  [D4.5.5]  we  have 

ui  u 

--  - 1 < dK 

c . — 

1 

and  from  the  algorithm  we  have 

If  1/ 

£ = (1+d  )c  . 

Consider  the  sets  J,  K of  the  previous  section,  along  with 
their  bounds  on  ua  . 


For 

i6J 

“I 

--  < 

ci 

(1-p)--  = (1-0)(1+dK)  = (1+dK)  - p(1+dx) 

For 

i6K 

c . 

l 

ua 

I 

--  < 

c . 

l 

u.  ck 

1 1 l<  1c 

--  - 0—  < 1 fdK  - 0(  1 +dK ) 

c . ~ 
1 

Gi  ci  “ 

so  that  for  all  i we  have 

a 

ui  k 

--  - 1 < ( 1 -sa)dK  - 0 
c . ~ 


• u . 

And  note  that  d(ua,F)  = , a?X  , | --  -1  j 

luj/ci'  [_ci 

Identifying  _u^  with  u and  u**^  with  _ua  , we  tnus  have 

dx+ 1 < (1-0)dk  - <t> 

U + 1 

as  long  as  the  RH3  is  non-negative.  Clearly,  d =0  will  be 
achieved  in  a finite  number  of  iterations,  at  wnich  point 
the  algorithm  will  stop. 


[ ] 


This  concludes  the  proof  of  Theorem  4-IV. 
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4.7  REVIEW  OF  RESULTS  AND  COMPARISON  WITH  OTHER  WORK 

Starting  with  a simplistic  model,  we  proved  some 
properties  of  the  Selection  Algorithm,  a means  for 
generating  AA  . Using  these  properties  we  were  able  to 
prove  quadratic  convergence  of  an  iteration  scheme  for  a 
restricted  system  configuration.  We  then  generalized  our 
method  of  computing  A_X  , and  proved  quadratic  convergence  of 
an  iteration  scheme  for  a general  system  configuration . 
Next  we  incorporated  the  actual  functions  in  our  model, 
replacing  "local"  assumptions  (continuity)  by  "global" 
assumptions  (system  structure  and  bounds  on  certain  error 
terms  --  it  was  shown  th3t  these  bounds  are  reasonable  for  a 
large  system).  We  were  then  able  to  prove  convergence  of  an 
algorithm  operating  on  these  functions. 

The  use  of  the  "underlying  function"  assumption 
[A4.6.1]  seems  to  be  a powerful  way  to  deal  with  local 
discontinuities.  Katkovnik  [K1]  and  Ermol'ev  [ E 3 ] have 
suggested  using  Stochastic  Approximation  techniques  for 
"smoothing  out"  such  local  discontinuities.  (Conceptually, 
his  method  tries  to  identify  our  underlying  function.)  His 
method  however,  would  involve  a large  number  of  function 
evaluations*  to  compute  each  row  of  A;  our  method  requires 
only  one  new  evaluation  for  each  row  (see  [R4.6.12]). 


•Each  function  evaluation  involves  doing  an  allocation  of 
all  parts. 
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Oar  assumptions,  specially  for  Theorem  4-IV,  may  seem 
strict  and  somewhat  arbitrary.  However,  we  are  placing 
almost  no  restrictions  on  the  individual  Usage  functions 
u1(d1,£1).  If  we  used  Katkovnik's  approach,  along  with  a 
general  framework  for  the  analysis  of  Recursive  Stochastic 
Appr ox imat ion  Algor ithms ,( e .g . Ljung  [L6]  or  Kushner  [K2]) 
we  would  require  strict  conditions  on  the  differentiability 
of  various  functions  and  bounds  on  their  derivatives.  As 
above,  one  may  think  of  our  "global"  assumptions  as 
replacing  such  "local"  assumptions. 

The  final  test  of  the  validity  of  our  assumptions  is 
ofcourse  use  of  the  algorithm  in  actual  cases.  We  have  used 
the  Algorithm  successfully  on  numerous  problems,  both  for 
evaluating  design  of  new  Warehouse  facilities,  and  for 
improving  the  operation  of  existing  facilities.  Examples  of 
this  will  follow  in  the  next  Chapter. 
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CHAPTER  5 

THE  DESIGN  OF  A PRACTICAL  RESOURCE  MANAGEMENT  SYSTEM 

5.1  OVERVIEW  OF  CHAPTER 

The  previous  Chapter  dealt  with  the  "Initial  Allocation 
Problem".  In  this  Chapter  we  illustrate  how,  in  an 
operational  system,  we  implement  the  solution  to  this 
problem,  and  how  our  approach  enables  simple  and  efficient 
solution  of  both  the  "New  Assignment  Problem"  and  the 
"Periodic  Review  Problem"  (see  Chapter  1 for  the  definition 
of  these  problems). 

We  shall  also  suggest  some  additions  to  the  algorithms 
of  Chapter  4,  which  enhance  their  operation  in  practice. 
Most  notable  among  these  is  the  incorporation  of  a "Cost  of 
Reallocation"  for  each  part. 

The  main  features  of  the  computerized  Resource 
Management  System  implemented  in  the  FIAT  Warehouse  are  then 
described,  including  some  details  of  the  program  package  and 
its  performance.  We  conclude  with  a representative  example 
of  the  use  of  the  program  to  solve  a typical 
design-evaluation  problem  for  the  warehouse. 
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5.2  CONCEPTS  OF  THE  PRACTICAL  SOLUTION 
5.2.1  Approach  Used 

The  first  step  in  the  practical  solution  is  deciding  on 
the  calculation  of  the  estimated  resource  usages,  that  is, 
the  functions  u^d1,^1).  For  each  part  i,  these  functions 
should  be  designed  to  estimate  the  current  mean  value  of  u1 
for  this  part.  An  example  of  such  a calculation  is  given  in 
Appendix  5A.  The  estimate  of  total  usage  of  the  rth 
resource  , i .e  . 

ur  = } 
i=  1 

will  then  be  the  sum  of  these  means. 

Rernar  k : Although  the  variation  in  the  individual  part's 
resource  usages  u^  about  their  mean  may  be  quite  high  from 
day  to  day,  the  variation  in  the  sum  u . will  be  much  less  in 
comparison  to  the  value  of  ur  itself.* 

The  In itial  Allocation  Problem  is  solved  with  the  value 
of  each  constraint  cr  set  at  some  proportion  p^  of  the 
actual  physical  capacity  of  that  resource,  for  example  101 
of  crane-time  or  90£  of  storage  capacity.  The  value  of  pr , 

•Formally,  the  standard  error  (=standard  deviation/mean)  of 
the  sum  of  N identically  distributed  independent  random 
variables  is  1/Jn  times  the  standard  error  of  any  one  of  tne 
random  variables. 
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for  each  resource  r,  is  set  by  management  to  reflect  a 
certain  safety  margin.  For  instance,  by  analysing  the 
history  of  demand  on  a particular  resource,  a reasonable 
safety  margin  could  be  set. 


The  New  Par ts  Problem  is  solved  for  each  new  part 


simply  by 

doing 

a Minimum 

Cost 

Allocation  for  that 

par  t , 

using  the 

current 

costs  A. 

In 

other 

words,  we 

solve 

equation 

(MCA)  [4 

.2.3]  for 

that 

part. 

Note  that 

the 

al locat ion 

for  each 

new  part 

can  be 

made 

by  solving 

this 

(relatively)  simple  problem  independently  of  the  rest  of  the 
par  ts  . 

For  the  Per iod ic  Review  Problem , we  note  that: 

1.  The  allocation  of  new  parts  will,  cause  a gradual 
addition  to  the  (average)  usage  of  various 
resources  . 

2.  The  changing  characteristics  of  each  part  will 
cause  changes  (increases  or  decreases)  in  the 
(average)  usage  of  each  resource. 

3.  We  note  that  the  changes  above  are  expected  to  be 

small  with  respect  to  the  constraints.  It  is 

physically  possible  to  continue  operation  due  to 
the  safety  margin  in  each  limit  vaiue. 
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4.  The  net  result  for  each  resource,  will  be  a gradual 
increase  or  decrease  in  the  average  total  usage  of 
that  resource. 

The  aim  of  the  periodic  review  is  to  track  these 
averages,  and  to  keep  them  below  a specified  value  (say 
cr+dr,  where  d^  is  the  maximum  deviation  allowed  for 
resource  r).  This  can  be  done  in  as  sophisticated  a manner 
as  desired.  For  instance,  in  the  Volvera  warehouse  we  found 
that  a monthly  re-allocation  was  sufficient.  In  a more 
rapidly  changing  environment,  a daily,  or  even  hourly 
reallocation  could  be  done. 

The  reallocation  is  done  by  using  the  latest  A,  say 

A°^d,  as  a starting  point  for  iterations  of  the  SALA 

algorithm.  Since  the  data  vectors  id1  for  each  part  have 

i old 

changed,  and  since  new' parts  have  been  added,  lKA  ) which 

was  feasible  earlier,  will  not  necessarily  be  feasible  now. 

However,  it  is  likely  that  a small  change  in  Xolci  (achieved 

in  one  iteration  of  the  SALA  algorithm,  say)  will  make  it 

n 6 w 

feasible.  The  new  value,  say  X , will  now  be  used  for 
allocation  of  each  part,  and  for  the  allocation  of  new 
parts.  This  method  can  be  further  enhanced  as  described 


below . 
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5.2.2  Cost  Of  Re-allocation 


We  now  describe  the  incorporation  of  a feature  called 

the  "cost  of  re-allocation"  for  each  part.  In  the 

n 0 w 

terminology  of  the  previous  section,  for  any  X , we  only 
change  the  allocation  of  a part  if  its  total  (new)  resource 
usage  cost  plus  a certain  re-allocation  cost,  is  less  than 
the  existing  (old)  resource  usage  cost,  that  is: 


Let 


x = arg  min  (MCA)  for  given  X 
—new  — 


new 


x^j  = current  allocation  of  part  i 


u^  = u^Cd1,^  ) [d1  is  the  current  (new)  data] 

1 1 vT  W " ""  I i w W * 


C(x\ 


/ — W w v * I w - UX  a.  wv  u w x VII  li  will  /v  _ T I 

— o la  —new  — o 1 d 

Then  we  reallocate  the  ith  part  from  x1,  , to  x1  only  if 

— n 1 n — n ^ u 3 


-old  = 'i^id) 

1 ) = cost  of  re-allocation  from  x1,  , to  x1 

-new 


-new 


[5.2.  1 ] 


XneW| u1 


,, , 1 .1  > . .new.  l 

-new  + — o 1 d ’ -n  e w - -old 


The  calculation  of  C(.,.)  will  be  discussed  below. 


The  addition  of  this  reallocation  cost  has  many 
advantages  : 


1.  It  makes 

tne  allocation  of 

eacn 

part 

"stable"  , 

that 

is,  for 

smal  1 

changes 

in  X 

, we 

do 

not 

find 

ourselves 

inov  ing 

the  same 

part 

back 

and 

forth 

( or 

else  we  might  spend  all  the  warehouse  resources  in 
constantly  reallocating  parts!) 

2.  It  makes  tne  decision  x_  , for  eacn  new  part,  valid 
in  the  long  term,  and  hence  a good  decision 
according  to  the  criterion  in  Chapter  2,  Section 
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4.3.  The  reason  for  this  is  that  usually  very 
small  changes  in  X are  sufficient  to  keep  the  total 
resource  usages  within  their  margins  as  above. 
Thus,  as  just  described,  the  allocation  of  each  new 
part  is  likely  to  remain  unchanged  for  some  time. 
Only  when  the  characteristics  (d1)  of  this  part 
change  significantly,  or  when  A has  changed 
signifipantly , will  it  be  likely  that  this  part 
will  be  reallocated,  to  reflect  the  new  situation. 

3.  The  reallocation  cost  can  be  incorporated  directly 

in  the  SALA  Algorithm  during  its  iterations,  so 

that  its  decision  jt1  for  each  part,  and  its 

calculation  of  total  resource  usage  u(Xnew) , will 

be  made  using  the  rule  [5.2.1]  above.  Thus  the 
new 

A found  by  the  algorithm  will  take  into  account 

the  reallocation  costs. 

4.  The  cost  calculation  C ( . , . ) can  be  modelled  to 
reflect  the  physical  work  necessary  to  change  from 
one  allocation  to  another.  (This  would  require 
some  "discounting”  calculation,  to  trade-off 
between  a one-time  cost  and  a daily  operating  cost 
over  some  time  horizon.) 

5.  The  calculation  of  C(  . , . ) can  also  be  made  to 
reflect  management  preferences  over  and  above 
physical  work  in  the  above  paragraph.  For  example, 
if  management  is  trying  to  gradually  shut  down  a 


r 
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certain  stor age-type , the  cost  of  moving  out  of  it 
can  be  set  lower  than  the  physical  cost,  whereas 
the  cost  for  moving  into  it  can  be  set  very  high. 

We  note  here  that  the  modification  of  the  SALA 
algorithm  described  in  this  section  is  a heuristical  one, 
whose  consequences  have  not  been  studied  theoretically. 
However,  it  appeals  to  common  sense,  and  has  definitely 
proven  its  worth  in  practice.  The  method  used  for 
calculation  of  C(.,.)  in  the  Volvera  warehouse  will  be 
described  in  sec. 5. 3.7. 


5.3  FEATURES  OF  THE  VOLVERA  IMPLEMENTATION 
5.3.1  Introduction 

The  SALA  program  package,  whose  features  are  described 
in  this  section,  was  designed  and  implemented  by  the  author, 
in  connection  with  the  C.S. Draper  Lab.,  for  use  by  FIAT 
Ricambi  at  their  Central  warehouse  in  Volvera,  Italy.  This 
program  package  provides  management  with  a versatile  tool 
with  which  to  tackle  the  Storage  Allocation  and  Resource 
Management  (SARM)  task  at  Volvera  (see  Chapter  2 for  a 
description  of  this  task).  The  concepts  of  SALA,  from  a 
management  point  of  view,  are  described  in  the  study  [F2], 
and  the  details  of  its  implementation  are  contained  in  the 
SALA  Reference  Manual  [S3].  A brief  description  of  the  SALA 
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program  package  follows  below. 

SALA  is  designed  to  provide  Volvera  with  three  main 
functions  for  the  SARM  task: 

1.  Periodic  Review  and  Reallocation 

2.  New  Parts  Allocation 

3.  Design  Evaluation 

The  first  two  of  these  have  been  described  in  Chapter  2,  and 
their  solution  is  along  the  lines  of  sec. 5. 2.1  above.  The 
third  is  an  important  additional  feature,  described  next. 


5.3*2  SALA  As  A Design  Evaluation  Tool 

The  power  of  the  SALA  package  lies  in  its  flexibility. 
It  has  been  designed  to  enable  management  evaluate  any 
future  warehouse  modifications,  within  the  framework  of  a 
set  of  known  possible  modifications.  (The  modularity  of  the 
package  means  this  set  can  be  expanded  in  the  future,  if 
necessary,  by  an  experienced  FORTRAN  programmer.)  In  effect, 
SALA  provides  the  user  with  the  building  blocks  with  which 
to  define  his  design,  and  also  with  the  tool  that  will  then 
evaluate  this  design.  This  enables  the  study  of  future 
modifications  to  the  warehouse,  such  as  closing  down  an 
existing  storage- type , or  adding  new  storage-types . A 
detailed  illustration  of  this  will  be  given  in  sec.  5.  <J. 
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We  find  that  the  FIAT  management  at  Volvera  are  making 
extensive  use  of  this  feature  for  evaluating  future  designs, 
and  consider  this  aspect  of  SALA  a valuable  decision-making 
aid  . 

5.3.3  Hierarchy  Of  Decision-Making 

We  describe  here  an  important  feature  of  the  SARM 
project.  The  SALA  algorithm  is  designed  to  make  long-term 
decisions  involving  alocations  of  parts  among  different 
stor age-types . Now  these  storage-types  are  themselves 
fairly  large  systems,  and  their  day-to-day  operation  is 
controlled  by  Real-Time  computerized  algorithms.  Other 
portions  of  the  project  have  studied  the  optimization  of 
operations  at  this  level.  Thus  we  have  a natural  hierarchy 
of  decisions.  The  operating  strategies  within  a 
storage-type  give  rise  to  certain  average  measures  of 
performance  in  that  storage-type  (e.g.  crane-time  per 
pick).  These  measures  are  then  U3ed  by  tne  SALA  algorithm 
for  its  decisions  between  storage-types . At  a yet  higher 
level,  the  strategy  of  the  SALA  algorithm  generates  certain 
performance  measures  for  the  Volvera  warehouse  as  a whole, 
and  these  are  used  by  another  project,  the  Inventory  Control 
project,  to  decide  on  the  listribution  of  stock  at  Volvera 
and  at  other  sites  throughout  the  world. 


12 
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5.3.4  Some  Program  Details 

The  3ALA  package  is  written  entirely  in  FORTRAN.*  It  is 
highly  modularized,  for  ease  of  modification,  debugging,  and 
implementation  on  different  computers.**  It  consists  of  some 
40  modules  totalling  some  7000  lines  of  FORTRAN.  Less  than 
30J  of  this  is  the  algorithm  code.  The  remainder  is  for 
several  different  options,  for  generating  various  statistics 
on  each  allocation,  and  for  detailed  diagnostics  if  required 
for  checking  the  operation  of  the  program. 

The  package  is  currently  implemented  on  a Honeywell 
6600  computer.  The  typical  CPU  time  for  an  iteration 
involving  all  60,000  parts  is  about  one  hour.  (An 
iteration,  as  described  in  Chapter  4,  involves  calculation 
of  the  Jacobian,  as  well  as  a step-size  calculation.)  Using 
a sampling  technique,  described  in  the  next  section,  it  is 
possible  to  solve  an  initial  allocation  problem  in  a few 
hours  of  CPU  time.  For  an  incoming  new  part,  a new 
allocation  decision  takes  only  a few  milliseconds,  and  can 
be  done  in  real-time.  A period  ic  review  usually  requires 

•The  output  generated  by  SALA  is  further  analyzed  and 
tabulated  for  warehouse  personnel,  using  COBOL  programs. 

**In  fact,  the  package  was  developed  and  tested  on  a DEC 
PDP-11/70  at  C.S. Draper  Lab.  (Cambridge,  Mass.)  before 
implementation  on  the  Honeywell  H6600  in  Italy. 
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only  one  iteration,  or  one  hour  of  CPU  time. 

This  amount  of  time  should  be  compared  with  the  File 
Allocation  Program  of  Mahmoud  and  Riordan  (see  Table  6-1,  in 
Chapter  6)  which  takes  16  minutes  of  CPU  time  on  an  XDS 
3igma  7,  for  a problem  with  20  files  and  5 nodes.  Our 
solution  is  for  a problem  with  60,000  items  and  16 
storage-types . We  can  also  compare  SALA's  performance  with 
the  previous  techniques  used  at  Volvera.  These  involved 
sorting  all  the  parts  into  various  subsets  according  to 
their  characteristics  (this  took  hours  of  CPU  time),  and 
then  manually  deciding  on  different  subsets  of  parts  to  be 
reallocated,  to  meet  a large  number  of  criteria  (this  took 
months  of  man-time).  In  either  case  SALA's  performance 
compares  very  favorably. 


5.3*5  The  Sampling  Technique 

This  technique  brings  the  algorithm  from  an  arbitrary 
initial  A,  say  A 0 , to  a nearly  optimal  A,  in  an  efficient 
manner  . A random  subset  of  parts  is  selected,  and  all 
constraints  are  reduced  proportionall y . For  example,  if  104 
(i.e.  6000)  parts  were  selected,  the  constraints  would  be 
set  at  104  of  their  actual  values.  The  algorithm  is  then 
made  to  iterate  on  this  sample  until  a solution,  say  A*,  is 
found.  Usually,  5 to  15  iterations  suffice,  and  since  this 
sample  is  a tenth  of  the  whole  sample,  10  iterations  can  be 
dona  in  an  hour.  Now,  because  the  sample  is  a 
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representative  cross-section  of  all  the  parts,  the  value  A* 
is  a very  good  starting  point  for  iterations  on  the  whole 
sample,  and  usually  one  or  two  iterations  on  the  whole 
sample  suffice.  This  means  that  the  entire  problem  can  be 
solved  in  3 to  4 hours  of  CPU  time,  as  mentioned  above.  The 
sampling  technique  is  another  way  in  which  decentral  i zed 
solution  of  the  problem  proves  to  be  advantageous. 


5.3.6  Solution  Of  MCA 

Each  iteration  of  the  SALA  algorithm  involves  solving 
the  Minimum  Cost  Allocation  problem  (MCA)  [4.2.3]  for  each 
part.  The  solution  to  this  problem  can,  in  general,  be 
designed  to  fit  the  needs  of  the  specific  implementation. 
For  che  Volvera  problem,  our  solution  is  described  below.  A 
rather  different  example  of  solving  (MCA)  is  given  in 
Chapter  6 for  the  File  Allocation  Problem. 

In  Chapter  4,  we  noted  that  we  would  restrict  each  x^ 
to  a discrete  strategy  set.  In  the  Volvera  warehouse,  a 
review  of  management  and  operating  requirements,  plus  the 
possible  complexity  of  solving  (MCA)  brought  us  to  the 
decision  that  each  part  would  be  stored  in  one  storage-type 
only.  (Some  exceptions  to  this  are  described  below.)  The 
rationale  for  this  can  be  stated  as:  "If  it  is  possible  to 
find  a feasible  solution  with  this  restriction,  then  we  have 
achieved  our  goal  anyway,  and  with  a simpler  strategy.  If 
we  do  not  achieve  a feasible  solution,  then  we  will  expand 
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our  strategy  set." 

If  we  were  to  use  integer  programming,  this  restriction 
would  still  mean  that  we  have  a 1-million  dimensional  0-1 
integer  programming  problem,  with  the  added  60,000 
constraints 


16 

T~  x1  = 1 for  each  i . 

L 3 

s=1 


However,  for  the  problem  (MCA),  this  restriction  makes  the 
solution  trivial,  and  it  can  be  found  by  a simple  "loop" 
over  the  16  storage-types . Let 

e^  = unit  vector  in  j direction 


and  let 


t = arg 


mm 


[s=  1 , . . . , 1 6]  — * ^ 


X ' u1 ( d1 , Q1eS) 


Then  for  the  ith  part,  the  optimal  allocation  for  a given  X 
is 

Xi(X)  = Qiet  . 

In  other  words,  allocate  all  the  quantity  to  the 
storage-type  where  tne  total  resource  usage  cost  is  a 
m i n i m urn  . 


For  operational  reasons,  it  is  necessary  for  some  parts 
to  be  stored  in  two  storage- types . This  ability  is 
incorporated  in  SALA  by  pooling  certain  resources  of  the  two 
storage-types , and  letting  SALA  find  an  allocation 
satisfying  those  pooled  constraints.  A second  program, 


4 


designed  to  satisy  other  management  r equirements , and  which 
operates  on  a more  frequent  basis,  then  decides  on  the 
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distribution  of  containers  between  the  two  areas.  This 
provides  a convenient  hierarchical  separation  of 
decision-making,  similar  to  that  described  in  sec.5.3-3 
above.  For  details,  see  Suri  C S 4 ] . 


5.3*7  Calculation  Of  Reallocation  Cost 

The  calculation  of  C(  . , . ) is  done  empirically  in  each 

review  period,  by  allowing  warehouse  personnel  to  examine 

detailed  "actual/recommended"  tabulations,  giving  the 

cost- improvement  A'Cu1.  .-u1  ) for  each  part.  (These 

— —old  —new 

tabulations  compare  the  ac  tual  allocation  with  SALA’s 
recommended  re- al locat ion ) . The  tables  are  sorted  into 
subsets  by  current  and  recommended  allocation,  and  each 
subset  ordered  by  cost-improvement.  By  choosing  a threshold 
value  for  cost-improvement  for  each  subset  of  tabulations, 
warehouse  personnel  are  able  to  control  the  number  of 
reallocations  between  any  two  storage- types  in  each  period. 
Their  decisions  are  based  on  their  current  priorities  and 
the  workload  in  tne  warehouse.  The  tabulations  are  also 
useful  when  carrying  out  reallocations:  parts  with  the 

greatest  cost- improvement  are  always  reallocated  first, 
since  tney  represent  the  greatest  mismatch  between  current 
allocation  and  part  char ac ter i st ic s . 


* 
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5.4  EXAMPLE  OF  A DES IGN -E VA LUATION  PROBLEM 

Ae  illustrate  the  use  of  SALA  at  Volvera  with  a typical 
design-evaluation  problem.*  This  involves  selection  of  parts 
to  be  allocated  to  a proposed  new  area,  at  the  same  time 
reallocating  parts  between  existing  areas  to  meet  future 
requirements . 

The  Volvera  management  is  considering  building  a new 
area  at  Volvera,  called  the  MPM  ("Magazzino  Prelievo 
Manuale",  or  manual  picking  warehouse),  with  very  high 
picking  capacity,  but  only  a limited  number  of  parts  can  be 
assigned  there.  The  SALA  program  is  being  used  to  evaluate 
various  design  specifications  for  the  MPM,  by  looking  at  the 
effect  of  certain  decisions  on  the  operating  statistics  of 
tne  whole  warenouse. 


A 

typical 

design 

specification  is 

given  in  Table 

5-1 . 

This 

spec l fication 

is  for  5000 

part-numbers  and 

12 

s torage 

-types . 

There 

are  17  constrain 

ed  resources  (see 

the 

first 

column) 

. For 

example,  the 

entry  "M-11  Part 

No  s" 

refers  to  the  number  of  Fart-numbers  in  storage-type  "M-11". 
This  is  a storage-type  in  the  MPM,  and  it  has  a limit  on  the 
number  of  parts  it  can  handle,  as  mentioned  above.  Some 
resources  are  shared,  for  example  (1+M)-11  means  a resource 


•This  example  uses  a sample  of  500C  parts,  so  as  to  maintain 
confidentiality  of  the  detailed  operating  statistics  of  the 
FIAT  warehouse. 
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used  by  both  storage-type  1-11  and  storage-type  M-1 1 . (The 
.meaning  of  these  codes  is  not  releve.nt  to  this  explanation.) 

The  next  two  columns  in  the  Table,  "Usage  Today"  and 
"Usage  Limit"  reflect  the  current  operating  values  versus 
the  design  specifications  laid  down  by  management.  The 
number  of  Part-Nos  in  M-1 1 , M-20,  M-60  (all  in  the  MPM)  are 
currently  zero,  and  their  design  capacities  are  given  as 
291,  353,  and  20,  respectively.  In  other  resources,  we  see 
that  management  would  like  to  see  some  decreases  in  the 
usages  --  most  notably  in  "1-20  Crane-Minutes"  which  exceed 
tne  desired  value  by  500%.  (The  sign  ">"  is  used  in  the 
Table  to  flag  over-used  resources.) 

The  problem  of  finding  an  allocation  to  meet  the  future 
capacities  was  solved  using  the  SALA  program.  It  was  found 
that  for  the  specified  capacities,  a solution  did  indeed 
exist,  and  the  resource  usages  of  this  solution  are  given  in 
the  fourth  column  of  the  table.  The  main  point  to  rote  is 
that  after  reallocation,  we  find  very  little  spare  capacity 
in  most  resources,  showing  that  the  problem  as  specified  is 
not  an  easy  one  to  solve.  (Resources  whose  future  usage  is 
up  to  their  design  capacity  are  marked  with  "="  or  "-"  in 
the  table.) 

The  costs  A computed  by  SALA  for  each  resource  are 
shown  in  the  final  column.  If  the  MPM  were  ready,  and  parts 
h3d  been  assigned  as  recommended  by  SALA,  then  new  parts 
arriving  at  the  warehouse  would  be  assigned  using  these 
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costs  to  compute  their  best  allocation.  Also,  for  the 
design  problem,  if  management  decided  to  try  a different 
design,  these  costs  would  be  a good  starting  point  for  the 
next  set  of  iterations. 
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Table  5-1 : Design  Evaluation  Example 


RESOURCE 

USAGE 

USAGE 

-■ 

-3 A LA -A  LL0C ATI0N 

NAME 

TODAY 

LIMIT 

USAGE 

COST 

M — 1 1 

Part  Nos 

0 

291 

289 

2798 

M -20 

Part  Nos 

0 

353 

340 

4178 

M -60 

Part  Nos 

0 

20 

20 

17498 

Q-1  1 

Part  Nos 

3^9 

> 

292 

- 

292 

2756 

Q-20 

Part  Nos 

63 

> 

50 

= 

50 

4953 

Q-60 

Part  Nos 

7 

7 

- 

6 

17333 

( 1 +M ) - 1 1 

Shelf  Slots 

2341 

2337 

2334 

255 

( 1 +M)-20 

Shelf  Slots 

3400 

4201 

z 

4200 

1 1 09 

( 1 +M  ) - 6 0 

Shelf  Slots 

2091 

2177 

- 

2171 

3105 

1 -( 1 1 >60 ) 

Crane-mins 

516 

>> 

1 90 

167 

11385 

1 -20 

Cr ane-min  s 

1063 

>> 

175 

155 

16800 

2-1  1 

Shelf  Slots 

2562 

> 

2078 

1 842 

450 

2-12 

Snelf  Slots 

1002 

> 

353 

826 

236 

2-0  1+12) 

Crane-mi  ns 

' 397 

>> 

98 

- 

96 

6001 

3-15 

Shelf  Slots 

2351 

2733 

2564 

79 

3-16 

Shelf  Slots 

2493 

> 

1309 

1017 

1 80 

3-05+16) 

Man-mins 

249 

175 

166 

4500 
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5.5  DISCUSSION 

This  Chapter  extended  the  theoretical  work  of  the 
previous  chapters  to  the  design  of  a practical  system.  We 
illustrated  our  methods  with  details  of  the  system 
implemented  at  the  FIAT  warehouse  in  Volvera.  In  addition 
to  the  operational  aims  of  the  Resource  Management  task, 
this  system  is  being  used  extensively  as  a design  tool,  for 
evaluating  possible  modifications  to  the  warehouse. 
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APPENDIX  5A 

EXAMPLE  OF  CALCULATION  OF  USAGE  FUNCTION 


i 1 1 

This  Appendix  shows  a typical  calculation  of  u (d‘,x  ) 
for  a part,  according  to  tne  approach  of  sec. 5. 2.1.  The 
resource  in  concern  will  be  the  crane-time  for  a part  stored 
in  the  "High  Shelf  Area"  and  picked  at  the  "bays"  (see 
Fig. 2-1  and  accompanying  description). 

Containers  for  such  parts  arrive  at  the  High  Shelf  Area 
via  an  input  ramp.  From  there,  they  are  loaded  into  the 
shelves  by  a crane.  Picks  for  these  parts  are  serviced  by  a 
crane  taking  a container  down  to  a manned  bay,  and  after  the 
pick  is  done,  by  returning  the  container  to  the  shelf.  When 
a container  is  emptied  by  a pick  at  the  bay,  it  is  taken  to 
an  exit  ramp,  instead  of  back  to  the  shelf. 

We  now  demonstrate  the  calculation  of  total  crane-time 
per  day  used  by  such  a part.  We  have  kept  the  calculation 
simple  for  illustration  purposes;  more  sophisticated 
assumptions  and  calculations  could  easily  be  made  in  the 


same  way. 
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The  part-data  d1  required  for  this  calculation  can  be 
summar ized  in  : 

C = no.  of  Containers  emptied  per  day. 

The  warehouse  operating  data  required  is: 

TL  = crane  Time  to  Load  new  containers  into  shelf 

from  input  ramp 

TS  = crane  Time  to  Supply  container  to  bay  from  shelf 

TR  = crane  Time  to  Retrieve  container  from  bay  to  shelf 

TU  = crane  Time  to  Unload  empty  container  from  bay 

to  exit  ramp 

(All  the  above  data  items  are  mean  values.) 

Let  [C]  = integer  part  of  C 

{C } = fractional  part  of  C ( = C — [ C ] ) 

Since  partly  full  containers  are  replaced  on  the  shelf, 
a fractional  value  of  C,  say  0.25,  means  that  (on  average) 
the  container  will  be  bay-picked  and  replaced  on  3 days,  and 
bay-picked  and  emptied  on  the  fourth.  Thus: 

3ay  Picking  Crane  Time/day 

= C»TL  + [C  ] * (TS+TU  ) + {C}*(TS+TU)  ♦ ( 1 - {C  } ) * (TS+TR  ) 

= C*TL  + C*(TS+TU)  + (1-{C})*(TS+TR)  . 

(loads)  (empties)  ( par tl y-ful 1 s) 
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CHAPTER  6 

APPLICATION  TO  THE  FILE  ALLOCATION  PROBLEM 

6.1  INTRODUCTION 

6.1.1  Overview  Of  Chapter 

With  the  growing  size  of  distributed  computer  networks, 
the  problem  of  allocation  of  resources  in  these  networks  is 
becoming  increasingly  important.  This  Chapter  addresses  one 
specific  aspect  of  the  problem:  that  of  File  Allocation. 
When  some  information  (say,  a "file”)  is  shared  by  several 
sites  ("nodes"),  the  question  arises  as  to  where  copies  of 
the  file  should  reside.  The  decision  for  a particular  file 
may  depend  on  its  Query  and  Update  traffic  at  each  node,  on 
the  Storage  costs,  the  Transmission  costs,  on  Response-time 
constraints,  and  Reliability-level  constraints.  Thus  the 
problem  for  a single  file  is  itself  quite  complex. 

Now,  when  several  thousand  files  are  to  be  allocated  in 
a network,  in  addition  to  minimizing  the  above  costs,  the 
network  constraints  (storage  capacities,  link  transmission 
capacities)  have  also  to  be  taken  into  account.  In  this 
case  the  problem  becomes  extremely  complex;  in  fact, 
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solutions  in  the  literature  have  (typically)  addressed 
problems  with  less  than  20  files  (see  sec. 6.  1.4). 

In  this  Chapter  we  show  how,  by  appropriate  formulation 
of  the  problem,  a decentralized  approach  is  possible.  This 
brings  previously  intractable  problems  (with  several 
thousand  files,  say)  within  the  realm  of  known  solution 
methods.  Our  aim  is  to  show  the  applicability  and 
advantages  of  the  decentralized  approach.  For  the  general 
optimization  problem,  we  do  not  go  into  details  of  solution 
algorithms,  but  give  a framework  for  future  research.  We  do 
however,  consider  one  case  in  detail  --  the  problem  faced  by 
a "Network  Manager"  whose  task  is  to  keep  a given  network 
operational  (that  is,  all  resource  usages  within  the 
constraints)  in  the  face  of  constant  arrivals  of  new  files, 
and  changing  character istics  of  old  files. 
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6.1.2  Problem  Statement 

The  general  problem  is  to  allocate  a number  of  files  to 
different  nodes  in  a computer  network,  so  as  to  minimize  the 
total  storage  costs  (of  the  files)  and  communication  costs 
(of  queries  and  updates),  subject  to  various  constraints. 
The  cost  tradeoff  can  be  illustrated  by  a simple  argument 
for  one  file.  At  one  extreme,  if  a copy  is  stored  at  each 
node,  then  queries  at  any  node  can  be  rapidly  answered, 
witnout  any  transmission  costs.  However,  in  this  case  an 
update  at  any  node  has  to  be  transmitted  to  all  the  other 
nodes.  Hence,  the  storage  and  update-transmission  costs 
will  be  high  for  this  case.  At  the  other  extreme,  if  the 
file  is  stored  at  only  one  node,  then  queries  arising  at  all 
other  nodes  need  to  be  transmitted  to  this  node,  and  in  this 
case  the  query  cost  is  high,  but  the  storage  and  update 
costs  are  low.  (The  response  time  for  queries  will  also  be 
large.)  This  argument  is  made  precise  in  the  following 
descr iption . 
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Basic  Parameters  (Given) 


Set  of  Files 
Set  of  Nodes 
Set  of  Links 


f 6 {1 F} 

n 6 1 1 ..... N} 
k 6 { 1 , . . . , K } 


File  Length  Vector 


Unidirectional  links  which 
connect  certain  nodes 
according  to  (given)  topology 

bf  = length  (in  bits)  of 
file  f. 


Query  Traffic  Matrix  Q 


Update  Traffic  Matrix  U 


Node  Reliability  Vector  r 


Qfn  = transmission  traffic 

(bits/sec)  generated  at  node 

n,  for  queries  to  file  f.  \ 

Ufn  = analogous  to  Qfn , but 
for  updates  to  f. 

rn  = reliability  of  node  n 
(probability  that  node  n is 
functioning  at  any  time) 


Link  Reliability  Vector  £ = reliability  of  link  k 

Link  Capacity  Vector  h h'K  = capacity  of  link  k 

(bits/sec) 


(Parameters  relating  to  cost  are  described  under  a separate 
head ing  . ) 
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Decision  Variables  ( A1 1 zero-one  variables) 

These  are  defined  formally  below.  X is  the  decision  as 
to  how  many  copies  of  each  file  are  to  be  scored,  and  where. 
Y is  the  decision  as  to  how  updates  originating  at  a 
particular  node,  and  for  a particular  file,  will  be  routed 
(to  all  copies  of  the  file).  Z is  the  similar  decision  for 
queries  (which  need  only  be  routed  to  one  copy  of  the  file). 


File  Allocation  Matrix  X 


Update  Path  Allocation  Y 
Array 


Query  Path  Allocation  Z 
Ar  ra  y 


Xfn  = 1 if  file  f is 
allocated  to  node  n. 

^fknfi  = 1 updates  for 
file  f,  originating  at 
node  n and  destined  for 
node  fi  , will  use  link  k. 

Zfkr.  = 1 if  queries  for 
file  f,  originating  at 
node  n,  will  use  link  k 
on  their  way  to  some  copy 
of  the  file. 
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Constraints 


File  Availability  Vector  a 

Storage  Capacity  Vector  £ 
Traffic  Intensity  Vector  £ 


af  = minimum  probability 
that  at  least  one  copy  of 
file  f is  accessible  at 
any  time  (depends  on  node 
and  link  reliabilities). 

sn  - on-line  storage 
capacity  (bits)  at  node  n. 

traffic  (bits/sec)  over 
link  k must  not  exceed 
Pk^k  • 


(The  availability  constraint  also  ensures  that  at  least  one  1 

copy  of  each  file  is  allocated.)  The  significance  of  certain 
terms  above  will  be  discussed  further  in  sec. 6.  1.3* 

Costs  and  Objective  Function 


Storage  Cost 

Vec  tor 

cS 

Cp  = cost  of  storing  one 
for  one  second  at  node  n. 

Tr ansm  ission 
Vector 

Co  st 

T 

c 

T 

c£  = cost  of 
one  bit  over 

tr ansm  itting 
link  k . 

The  objective  is  to  find  t X , Y , Z } to  minimize: 

Total  Storage  Costs  (function  of  X) 

+•  Total  Transmission  Costs  (function  of  X,  Y,  Z) 

subject  to  the  constraints  on: 

File  Availability  (function  of  X) 

Storage  Capacity  (function  of  X) 

Traffic  Intensity  (function  of  X,  Y,  Z) 


A mathematical  formulation  of  the  above  will  be  given  later. 
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6.1.3  Comments  And  Comparison  With  Other  Models 

In  the  literature  on  the  File  Allocation  Problem, 
although  the  flavour  of  the  problem  is  the  same,  when  we 
investigate  details  we  find  no  generally  accepted  model. 
Retrieval  time,  which  is  given  so  much  emphasis  in  [C2]  and 
[Ml]  is  completely  missing  in  C C 3 ] and  [L7J.  Similarly, 
File  Availability  is  considered  only  in  [Ml],  and  Storage 
Capacity  appears  only  in  [C2].  Thus  there  appears  to  be  a 
wide  range  of  acceptable  models  in  this  field.  In 
comparison  with  other  work,  our  model  has  the  following 
features  : 

1.  File  Availabil ity . This  rather  complex  constraint, 
which  leads  to  highly  nonlinear  constraint 
equations  (see  analysis  in  [Ml]),  is  retained  in 
our  model,  and  we  shall  see  that  it  is  easily 
incorporated  in  our  analysis. 

2.  Lin  k Capacity . In  some  formulations  [Ml]  choice  of 
link  capacity  is  also  included  as  a decision 
variable  in  the  model.  We  assume  an  existing 
network  with  given  capacities. 

3.  Retr iev al  Time . Instead  of  using  this  constraint 
directly,  as  in  [C2]  or  [Ml],  we  have  a traffic 
intensity  (or  congestion)  constraint  for  each  link. 
The  justification  for  this  is  that  tne  average 
retrieval  times  between  nodes  are  a function  of  the 
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traffic  intensities  (see  [Ml]),  and  by  specifying 
the  latter,  we  can  calculate  upper  bounds  for  the 
retrieval  times. 

4.  Decisions.  We  are  able  to  incorporate  even  the 
(very  high-dimensional)  arrays  Y and  Z in  our 
formulation,  and  to  solve  for  them  quite  easily. 

5.  Program  Allocation . In  some  formulations  (e.g. 
[ L 7 3 ) the  file-allocation  problem  is  considered 
simultaneously  with  the  program  allocation  problem. 
We  do  not  consider  program  allocation  at  all. 

The  approach  taken  by  Chang  [Cl]  is  of  a different 
nature.  He  considers  an  extremely  general  model  of  the 
Distributed  Computer  System  Design  Problem,  with  a very 
large  number  of  parameters.  His  goal,  however,  is  to  find  a 
system  configuration  satisf  ying  various  performance 
requirements  and  design  constraints.  The  criterion  of 
optimality  is  thus  replaced  by  user  satisfaction  with  a 
design . 

A major  criticism  of  all  the  existing  models  has  been 
given  by  Rothnie  and  Goodman  [R4].  They  state  that  these 
models  neglect  three  important  factors: 

1.  The  models  do  not  adequately  reflect  user  demand 
for  database  access  involving  more  than  one  file  at 


the  same  time. 
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2.  The  models  completely  neglect  the  synchronization 
costs  involved  in  updating  redundantly  stored  data. 
This  cost  varies  in  a complex  way  with  th.e  other 
design  decisions. 


3.  The  models  assume  that  complete  files  should  be  the 
unit  of  assignment  to  nodes.  (There  are  situations 
in  which  an  optimal  partition  of  a file  will  give 
better  results.) 


Their  criticism  is  well-taken,  and  our  model  also  suffers 
from  these  shortcomings.  However,  we  feel  that  even  the 
existing  models  have  not  yet  been  solved  efficiently.  Our 
efforts  are  a step  in  the  direction  of  solving  such  models, 
and  only  after  this  can  we  hope  to  solve  the  even  more 
complex  ones  obtained  by  including  the  suggested  features. 


6.1.4  Comments  On  Existing  Solution  Methods 

A cross-section  of  recent  work  in  this  area  is 
represented  by  [C2],  C C 3 ] , [L7],  [Ml],  C W 1 ] . A review  can 
be  found  in  [L7].  The  solution  methods  vary  from  integer 
programming  [C2]  and  linear  programming  [W1],  to  a 
heuristical  technique  combining  dynamic  programming  and 
search  methods  [Ml].  All  methods  suffer  from  intractability 
as  soon  as  the  problem  becomes  large.  Typical  problems 
solved  are  given  in  Table  6-1,  where  we  see  that  a problem 
with  5 files  and  20  nodes  would  be  of  the  order  of  the 
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largest  solved,  and  would  take  16  minutes  of  CPU  time.  The 
area  where  we  intend  to  research  this  problem  --  1,000  to 
100,000  files  and  10  to  20  nodes  --  is  clearly  out  of  range 
of  these  methods. 

Another  shortcoming  of  the  solution  methods  relates  to 
a problem  encountered  in  everyday  systems:  how  to  allocate 
a new  file  that  has  appeared  in  the  system?  The  methods 
offer  no  simple  way  of  making  this  decision,  short  of 
re-solving  the  whole  problem  for  the  new  set  of  files. 
Again,  this  is  an  area  where  we  can  make  a contribution. 

A useful  method  from  our  point  of  view,  and  one  we 
shall  return  to  in  sec. 6. 2. 3,  is  that  of  Casey  [ C 3 J - He 
solves  the  allocation  problem  for  copies  of  one  file  only , 
with  storage  and  cornmun ication  costs,  but  no  constraints. 
Since  we  shall  refer  to  his  results,  we  summarize  the 
problem  solved  by  Casey  in  the  next  section. 
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Table  5—1 : Typical  File-Allocation  Problems  Solved 


Ref  crence 


Problem  Size  CPU  Time  Computer  Used 


[ C 2 ] Chu  5 Files  25  sec  IBM  360/65 

3 Nodes 
Only  1 copy 
of  each  file 


C C 3 3 Casey  1 File 

19  Nodes 


2 sec  IBM  360/91 


[Ml]  Mahmoud 
& Riordan 


95  (Files  x 

Nodes) 

170  (Links  x 
Capacities) 


16  min  XDS  Sigma  7 


Notes  : 

1.  It  is  not  possible  to  make  an  even  comparison, 

since  the  assumptions  used  differ  so  much  in  each 
case  above.  For  example,  [C2]  and  [Ml]  used 

complex  constraints,  while  [C3]  had  no  constraints. 
[C2]  did  not  allow  copies  of  files,  while  the 
others  did.  [Ml]  included  decisions  on 

link-capacities.  Thus  the  above  Table  should  not 
be  used  to  compare  the  different  solutions,  but 
only  to  get  an  idea  of  tne  computer  times  involved. 

2.  Casey's  numbers  were  for  a total  of  6 runs,  so  our 
figure  above  is  an  average  of  those  runs.  Also, 
Casey  solved  for  both  a "linear"  and  a "nonlinear" 
relaying  assumption  for  updates;  the  figure  above 
is  for  the  linear  case. 
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6.1.5  Casey's  "One-File"  Problem 

To  avoid  duplication  of  notation,  we  shall  describe 

Casey's  problem  for  one  file  using  our  notation  of 

sec. 6. 1.2,  but  with  the  value  of  the  subscript  f fixed  , that 

is,  a specified  file.  The  given  parameters  are  Query  and 

Update  traffic  (Qfn  and  Ufn , for  each  n),  file  length  (bf), 

S 

and  storage  costs  (c  ).  Instead  of  deciding  on  query  paths 
and  update  paths  as  above,  Casey  assumes  simply  tnat  a cost 
of  unit  transmission  from  node  n to  node  PI  is  given  (=d 

nn 

and  tnere  are  no  link  capacity  constraints.  The  decision  to 
be  made  is  only  the  file  allocation  (Xfn,  for  each  n),  that 
is,  at  which  node  should  copies  of  f reside  ? For  a given 

f 

allocation,  say  Xfn=1  for  n 6 N , the  total  cost  incurred  is 

f 

a function  of  N : 

r N N , 

C6.1.U  C(Nr)=r  [“fn  rXffldnf1+  JJeNf  dnfl+  bfcn*fnJ 

n=  1 PI  = 1 

This  can  be  seen  as  follows  --  for  a given  node,  say  n,  all 
updates  arriving  at  n must  be  transmitted  to  all  copies  of  n 
(see  first  term  above),  queries  will  be  answered  from  the 
"cheapest"  node  (second  term),  and  there  will  be  a storage 

f 

cost  (third  term).  C(N  ) is  now  obtained  by  summing  over  n. 

Casey’s  so  1 ut ion  method  is  best  visualized  using  the 
hypercube  of  all  possible  allocations  for  file  f.  Fig. 6-1 
illustrates  this  for  N=4  nodes.  Each  vertex  of  the 
hypercube  represents  a value  of  the  N-dimensional  binary 
vector  (xf  1 , Xf 2, . . . , Xf ^) . Each  downward  edge  joining  two 


L 
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vertices  represents  an  addition  of  exactly  one  copy  of  the 
file.  Let  us  consider  all  possible  downward  path3  , from 
(0,0,... ,0)  to  (1,1,... ,1).  A theorem  by  Casey  states  that 
along  any  such  patn,  wnenever  a cost  increase  is  encountered 
in  a step  from  a vertex  V to  tne  next  vertex,  that  path  can 
be  abandoned,  since  it  will  not  lead  to  any  points  lower  in 
cost  than  V.  Using  this  property,  a "path  tracing" 
algorithm  of  Casey  can  be  used  to  find  the  optimal 
allocation,  without  computing  the  cost  of  every  vertex. 


Fig . 6- 1 : Hypercube  of  Allocations  for  4 Nodes 
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6.2  ANALYSIS  OF  MODEL 

6.2.1  Mathematical  Formulation  Of  Problem 


[6.2.  1 ] Transmission  Costs 


Let  TCf(X,Y,Z)  = Total  Transmission  costs  for  file  f 

when  decisions  X,Y,Z  are  taken 

N ( Cost  of  Updates  Cost  of  Queries  ; 

= J~  < from  n to  all  + from  n to  the  > 
n~  I copies  of  f specified  copy  > 

x of  f 

N { N K _ K \ 

= y_<  ufn  T_  (XfrT  T^CKYfknfi )+  Qfn  J^kZfkn  > 
n=  1 \ fl  = 1 k=  1 k=  1 ) 


[6.2.2]  Traffic  over  Links 


Let  Tf  (X,Y,Z)  = Total  Traffic  over  link  k due  to 

file  f when  decisions  X,Y,Z  are  taken 


( Traffic  over  k 


= r<: 

n=  1 


V 

/ 


Traffic  over  k \ 
due  to  Updates  + due  to  Queries  j 
from  n to  all  from  n for  f > 

copies  of  f I 

) 


N ; N ; 

= Ufn  T-  XfflYfknh  ♦ 3fnZfKn  > 

n=  1 ^ flrl  ) 

[6.2.3]  Remark:  Note  that  re-arrangement  of  the  equation 
for  transmission  costs  [6.2.1]  gives 


K T 

[ 6 . 2.  ^ ] TCf ( X , Y , Z ) = Y CkTfk(X,Y,Z) 

kTl 


as  is  to  be  expected,  since  the  total  transmission  costs  can 
be  thought  of  as  arising  in  either  of  these  two  ways. 
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Lb. 2. 5]  ProDlem  Statement 


Define  the  set  W of  admissable  decisions  to  consist  of 
decisions  (X,Y,Z)  which  satisfy  the  following: 


X is  such  that  each  file  f satisfies  its  availability 

constraint . 

Y is  such  that  for  each  f,  the  set  of  links 

{k  > ^fknf5=1J  form  a path  from  node  n to  Pi. 

Z is  such  that  for  each  f,  the  set  of  links  {k  ! Zf^nzl) 

form  a path  from  node  n to  some  node  Pi  with  Xf^=1. 

< 

Tne  problem  can  then  be  stated  as 

F N F 

[6.2.6]  min  V~  7~  CnbfXfn  + TCf(X,Y,Z) 

( x , y , z ) ew  f— | n— i f— i 

(storage  costs)  + (transmission  costs) 

subject  to 
F 

J~  bfXfn  £ sn  ^or  each  n (storage  capacity) 

fTj 


p 

^ Tfk  (X,Y,Z)  £ P[<h^  for  each  k (link  capacity) 

fTl 


6.2.2  Lagrange  Multiplier  Approach 


For  the  problem  above,  we  use  conventional  optimization 
theory  [BID,  [L5],  [LI],  to  formulate  the  Lagrangian.  Let 
ii  = Multiplier  for  Traffic  Constraints 
2 = Multiplier  for  Storage  Constraints 
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The  Lagrangian  L is  then  given  by 
F N F 

L = r rc^fxfn + rTcf(x-Y>z) 


f=1  r.=  1 f-1 

N 

F 

♦ r *n 

( rbfXfn  ~ sn  ) 

n=1 

f=1 

K 

F 

♦ ak 

( y Tfk(X,Y,Z)  - pkhk) 

k=  1 

f=1 

Using  [6.2.4]  and  also  defining  the  modified  "costs" 

T 

H = H + £ 

. _ S 
v = v + c 

we  rearrange  L to  get 

F [ N K Xj 

L(X,Y,Z,u,v)  = r<,bfrvnxfn  + rul<Tf  k ( X , Y , Z ) > 
f=1  \ n= 1 k = 1 } 

N K 

- "J~ vnsn  - J~ ukpkhk  + constants 
n~1  k~i 

The  saddle-point  theorem  [LI]  states  that  a solution  to 
the  original  problem  [6.2.6]  will  be  obtained  if  we  can  find 
( X * , Y* , Z*  , u*  , v* ) which  simultaneously  satisfy: 


[6.2.7] 

(X* , Y* , Z« ) 

= arg 

min  L(X , Y, Z , u*  , v* 

(X,Y,Z)ew 

[6.2.8a] 

u* 

= arg 

max  tL(X*,Y«,Z*  ,u  ,v«) 

u > c 1 

[6.2. 8b] 

l< 

* 

= arg 

max  SL(X«,Y*,Z*  ,u*  ,v) 

111 

where  the  vector  inequalities  are  to  be  interpreted 


componentwise . 
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6.2.3  Decentralized  Solution  Technique 

This  approach  has  already  been  described  in  Chapters  2 
and  3,  and  is  suggested  by  the  statements  [6.2.7]  and 

[6.2.8]  above.  The  decentralized  solution  algorithm  starts 
with  initial  guesses  for  u and  v^  and  solves  [6.2.7]  to  get 
\ and  say.  It  then  uses  these  values  of  X, 

Y,  Z,  to  modify  u and  v slightly,  so  that  improvements  in 
the  objectives  in  [6.2.8]  are  obtained . This  process  is 
repeated  until  equilibrium  is  obtained,  at  which  point  the 
values  of  X,  Y,  Z,  will  be  solutions  to  the  optimization 
problem  [6.2.6].  Details  of  this  approach,  as  well  as 

conditions  for  its  convergence,  can  be  found  in  [ A 1 ] . Our 

aim  here  is  to  examine  its  applicability,  which  will  be 

brought  out  in  the  Theorem  below. 


[6.2.9]  Theorem  6-1  (Decomposition  of  File-Allocation 
Problem ) 


The  optimization  problem  (for  given  u,  v.) : 

[6.2.10]  Find  (X«,Y«,Z*)  = arg  min  L (X , Y, Z , u , v ) 

( x , y , z ) ew 

is  equivalent  to  a set  of  F ind  epend  ent  individual 
file-allocation  problems,  and  each  of  these  individual 
problems  is  equivalent  to  the  problem  solved  by  Casey 
( sec  .6.1.5).  [ ] 
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Pr oof : We  will  omit  terms  not  containing  X,  Y,  or  Z.  Also, 
for  notational  convenience  we  define  the  following  vectors: 


1 

vector 

r 

X 

such 

tha  t 

f 

xn  = 

Xf  n 

[6.2.11] 

vector 

fnfl 

1 

such 

that 

f nf! 
yk  = 

Yfknfi 

• 

vector 

fn 

z 

such 

that 

f n 

2k  = 

2f  kn 

• 

In  the  above,  if  we  restrict  (X,Y,Z)  to  range  over  W, 
the  values  of  x ^ -*'n 


then 


y , z'  '•  , generate  the  corresponding  sets 

of  admissable  decision  vectors.  These  sets  will  be  denoted 

by  X^ , Y^nn,  Z^n  respectively.  Also  define 

dnj1  = cost  of  minimum-cost  path  from  node  n to  h (for 
given  transmission  cost  vector  u)  . 

The  problem  is 

/ 


F [ K N \ 

J~  <,  Y^_uk  ^ ^ ) + bf^vn*fn  ? 

f=1  \ k = 1 n= 1 ) 


min 

( x , y , z ) ew 

which,  using  [ 6 . 2.  1 ]- [6 . 2. 4 ] becomes 


N 


/ 


N 


K 


\ 


m^-n  r < ufn  Txffi  rukYfknn+Qfn  T uk^f kn  + bfvnXf n> 

(X,Y,Z)6W  f:in:l\  0=1  k=  1 k=  1 j 

Mow,  by  consiaering  the  independence  of  certain  decisions 

(since  there  are  no  constraints  for  this  problem),  we  can 

interchange  some  of  the  "min"  and  " operations,  to  get: 


F 

r 

f= i - 


N 


t*?eSf  £<><■"  r4  /"*«*>»  < > 

n=  1 ^ 0 = 1 


K 

r 

k=  1 


+ 2f^ezfn  (£ukZkn)  + bfvn%  > 

k=1  } 
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A 


Defining  the  set  of  nodes  N*  = [n  ! 

definition  of  dnj,  above,  tnis  equals 

/ 


N 


mxn  „ 

fTl  ~ nTi  ^ 


N 


and  using  the 


\ 


r r<>f-r^nn 


r\  m i n « , 

+ Qfn  he.Nf  d 


fl=  1 


nfl 


+ brVn 


which,  for  each  file  f,  is  ex ac tl y the  file  allocation 

problem  solved  by  Casey  [6.1.1].  (The  availability 
f f 

constraint  x ex  can  be  added  on  with  ease.  We  simply 
constrain  Casey's  algorithm  to  start  from  the  set  of  highest 
vertices  in  the  hypercube,  which  satisfy  the  availability 

f 

cr  terion,  i.e.  are  members  of  X . For  a calculation  of 
availability,  see  [Ml].  Note  that  the  above  set  of  vertices 
need  only  be  calculated  once,  for  each  file.) 


Thus  the  optimal  decisions  are  given  by: 


[6.2. 12] 


if  ■ -g  Urnf*^ 


N 


/ 


N 


f nfl 


n=  1 V PJ=1 
min 


nfl 


\ 
f ! 


Qfn  dnp5  - bfvnxn  > 

/ 


minr  _ *r — fnfl 

ar&  yfnfS0y^n^  /_ukyk 

L k=  1 


. f n min_  r — I 

1 = arg  Zfnezfn  J_u'<zk 

k=  1 

f 

Note  that  £ can  be  calculated  using  Casey's  algorithm, 
while  y^nrt  and  z^n  are  simply  the  minimum  cost  paths  between 


min, 


fn 


certain  nodes,  as  calculated  for  dn(1  above.  [] 


1 
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[6.2. 133  Inter pretation  of  Theorem 

We  have  thus  derived  rigorously  the  appealing  result  that 
the  problem  [6.2.10]  con  be  approached  individually  for  each 
file.  Note  that  the  actual  costs  are  replaced  by  the 
artificial  costs  (_u,v_)  when  we  solve  the  individual  file 
allocation  problem.  Conceptually,  we  have  created  a "File 
Manager"  for  each  file,  whose  job  is  to  allocate  his  file  in 
tne  cheapest  manner,  for  given  artificial  costs  u and 
v.  [] 

[6.2.14]  Remark : Morgan  and  Levin  [M2]  showed  that  the 
multifile  minimization  problem  could  be  decomposed  into 
individual  file  minimization  problems.  Their  proof  however, 
was  in  the  absence  of  any  link  capacity  or  storage  capacity 
constraints.  Our  result  is  therefore  much  more 
signif icant . [ ] 


6.2.4  Conjectured  Iteration  Algorithm 

Analogous  to  the  File  Managers  solving  [6.2.10]  for 
each  file,  we  can  think  of  a Network  Manager  who  is  in 
charge  of  satisfying  the  network  constraints,  and  who 
changes  the  artificial  costs  by  small  amounts  so  as  to  get 
an  improvement  in  the  objective  functions  of  [6.2.8]. 
Inspection  of  these  objectives  shows  that  (in  accordance 
with  our  intuition)  if  the  usage  of  a resource  is  over  its 
limits,  its  artificial  cost  will  be  increased  by  tne  Network 
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Manager  , and  vice-versa,  except  that  the  artificial  cost 
will  not  be  lowered  below  the  actual  cost. 

[5.2.  15]  Algor i thm  (Simple  Version ) 

The  simplest  decentralized  solution  algorithm  would  thus 
consist  of  a FM  ( File-Manager)  Subroutine,  which  would  solve 
[6.2.7]  for  each  file,  followed  by  a NM  (Net work -Manager) 
Subroutine,  which  would  change  the  resource  usages  as  above. 
The  algorithm  would  iterate  as  described  in  sec. 6. 2. 3- 


6.3  ALGORITHM  FOR  THE  FEASIBILITY  PROBLEM 

The  iteration  method  above  is  ratner  crude,  and  may 
converge  extremely  slowly,  or  not  at  all.  In  this  section, 
we  describe  a special  case  of  the  File  Allocation  problem, 
for  which  an  efficient  algorithm  exists,  and  for  which  the 
conditions  for  convergence  are,  by  comparison,  considerably 
less  stringent. 

Suppose  the  network  has  a very  large  number  of  files 
(say,  10,000)  and  is  managed  by  a network  manager  whose  task 
is  to  keep  the  network  operational,  i.e.  all  the  resource 
usages  within  the  constraints.  Suppose  also  that  he  must 
find  allocations  (in  real-time)  for  several,  newly  created 
files  every  day.  In  addition  he  must  periodically 
re-allocate  existing  files  according  to  their  changing 
characteristics,  and  according  to  the  changes  in  total 


resource  usages.  His  tasks  can  thus  be  summarized  as: 

1.  Find  an  initial  feasible  solution. 

2.  Find  a rationale  for  allocating  new  files  in 
real-time  (hence  must  be  a simple  algorithm). 

3.  Maintain  the  resource  usages  below  their 
constraints  by  periodic  re-allocation. 

The  main  problem  above  is  to  find  a feasible  solution  , that 
is,  a set  of  allocations  and  query/update  path  decisions, 
which  satisfy  all  the  storage  constraints,  link  capacity 
constraints,  and  file  availability  constraints.  For  a large 
number  of  files,  this  in  itself  could  be  a major  problem. 
An  economic  inter pretation  of  this  problem  would  be  as 
follows:  Suppose  a network  has  already  been  constructed 

witn  certain  storage  and  link  capacities,  and  the  additional 
cost  of  using  these  facilities  is  negligible  in  comparison 
with  the  original  investment.  Then  the  main  concern  of  the 
Network  Manager  is  one  of  feasibility.  A design-time 
interpretation  of  this  approach  would  be  to  say  it  is  of  the 
type  advocated  by  Chang  [Cl],  as  described  in  sec. 6.  1.3. 

The  feasibility  problem  has  already  been  studied  by  us 
in  Chapters  2 to  4.  Our  results  are  applied  below  to  the 
file  allocation  problem.  We  only  outline  the  application. 
The  details  are  an  easy  extension  of  the  work  in  previous 
Chapters  . 


/J 
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6.3*1  Application  Of  Our  Results 

[6.3.  1 ] 3tdte.nen  t 0 f the  Feasibility  Problem 
Find  (X,Y,Z)  such  that 

( X , Y , Z ) G W (includes  availability  constraint) 

F 

)~bfxfn  £ sn  for*  each  n (storage  capacity) 
fTi 

F 

J~ Tf.k(X,  Y,Z)  < Pi<bk  f'or  each  k (link  capacity) 
fTi 

[5 . 3.2]  Statement  of  the  Lagrangian  Problem 

Let  u >0  and  _>0  be  given  , and  let  X = (u’  , v_*  ) ’ . Let  , 
f n T f n 

2 , z , be  as  in  [6.2.12].  The  Lagrangian  Problem  is  to 

find  the  "decentralized  solutions"  X(X),  Y(X),  Z(X)  where: 
X(\)  is  such  that  Xfn  = 

Y(^)  is  such  that  Yfknfi  s y£n?I 

Z(X)  is  such  that  Zfkn  = 

[6.3.33  Ex  istence  Theorem 

(For  a formal  statement  see  Chapter  3).  This  theorem  says 
that  if  the  problem  [6. 3.1]  is  still  feasible  for  a si ightly 
tighter  set  of  storage  and  link  capacity  constraints,  then 
there  will  exist  multipliers  X*  such  that  the  decentralized 
solutions  X(^*),  Y(X*),  Z(X*),  will  be  feasible  for  the 
problem  [6.3*1]-  [ ] 
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[6 . 3. 4 ] Remar ks 

1.  A major  drawback  to  the  use  of  decentralized 
solutions  in  general,  is  the  existence  of  problems 
for  which  no  decentralized  solutions  exist  --  see 
the  remarks  on  "duality  gaps"  in  Chapter  2,  Sec. 7. 
Our  theorem  above  gives  conditions  which  ensure  the 
existence  of  a suitable  A*,  and  also,  the  remarks 
in  Chapter  3 show  that  these  conditions  are  likely 
to  be  true  for  a large  system. 

2.  As  pointed  out  in  Theorem  6-1,  for  given  A,  the 
decentralized  solutions  are  relatively  easy  to 
find.  The  problem  [6.3.1]  is  thus  reduced  to  one 
of  finding  an  appropriate  A*.  This  will  be  our 
next  topic  of  discussion. 

6.3.2  Finding  Suitable  Multipliers 

In  Chapter  4 we  gave  algorithms  for  finding  multipliers 
such  that  the  decentralized  solutions  solved  the  feasibility 
problem.  Consider  the  SALA-3  iteration  scheme  [4.6.11]. 
The  reader  can  see  that  a simple  modification  of  this  scheme 
(replace  the  step  " INITMCA"  by  a solution  of  [6.2.12],  and 
define  the  resource  usage  vector  to  be  the  storage  and  link 
capacity  usages  for  a given  solution)  gives  us  an  algorithm 
for  the  file  allocation  problem.  Theorem  4-IV  in  Chapter  4 




> 
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snows  that,  under  suitable  assumptions,  such  an  iteration 
scheme  converges  to  a solution  in  a finite  number  of 
iterations,  at  a better  than  geometric  convergence  rate. 
Discussions  in  Chapter  M also  demonstrate  that  the 
assumptions  are  reasonable  for  a large  system,  and  this  has 
been  upheld  by  use  of  the  algorithm  in  practice  (Chapter  5). 


6.3.3  Conjectured  Efficiency  Of  Iteration  Methods 

Experiments  with  the  SALA  algorithm  (see  Chapter  5) 
have  shown  that  it  converges  in  5 to  15  iterations, 
irrespective  of  the  number  of  items  (in  this  case,  files). 
The  solution  time  on  average  is  (say)  10  iteration  cycles. 
The  time  for  each  iteration  is  proportional  to 

(no.  of  files)  x (time  to  solve  one-file  problem)  . 

Now  Casey's  algorithm  for  solving  the  one-file  problem  has 
been  shown  to  be  NP-complete  [E2],  as  a function  of  the 
number  of  nodes  (N).  However,  for  a small  number  of  nodes 
(under  20),  the  time  to  solve  the  one-file  problem  is  still 
reasonable  (2  CPU-secs  for  19  nodes,  see  Table  6-1).  The 
solution  time  for  our  method  is  therefore  expected  to 
increase  linearly  with  the  number  of  files  and  exponentially 
with  the  number  of  nodes.  The  time  for  Integer  Programming 
solutions  would  be  exponentially  proportional  to  each  of 
these.  Thus  our  approach  is  expected  to  be  superior  for 
problems  witn  a very  large  number  of  files  (1000  to  100,000) 
and  a moderate  number  of  nodes  (10  to  20). 
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However,  note  that  since  the  changes  in  X at  each 
iteration  are  small,  it  may  be  possible  to  re-solve  the 
one-file  problems  in  a more  efficient  manner,  using  the 
results  of  the  previous  iteration.  This  would  extend 
applicability  to  problems  with  a larger  number  of  nodes,  and 
is  a subject  for  further  research. 


6.3-^  New  Files  And  Periodic  Re-allocation 

The  previous  sections  considered  the  (static)  file 
allocation  problem.  In  an  operational  network,  we  have  the 
two  additional  problems,  "new  file  allocation"  and  "periodic 
reallocation"  (sec. 6. 3.1).  The  solution  of  these  problems 
in  an  operational  system  has  been  discussed  in  Chapter  5. 
The  reader  can  see  that  this  approach  can  be  used  for  the 
file  allocation  problem.  Briefly,  the  new  file  allocation 
problem  is  solved  by  using  the  current  "costs"  X to  solve 
the  one-file  problem  for  a new  file  (can  be  done  in 
Real-Time),  and  the  periodic  review  problem  is  solved  by 
finding  a small  change  in  X such  that  the  resulting 
re-allocation  will  restore  usages  to  their  desired  levels. 
See  Chapter  5 for  details. 


6.4  SUMMARY 


We  have  demonstrated  the  scope  for  application  of 
decentralized  solution  techniques  to  the  file  allocation 


problem.  Our  formulation  allows  the  incorporation  of 
several  different  types  of  constraints,  including  the  rather 
complex  file  availability  constraints.  We  outlined  a 
possible  solution  algorithm  for  the  general  problem.  In  a 
special  case  of  this  problem,  we  showed  how  results  from 
Chapters  3 to  5 could  be  used  directly  for  solution  of  the 
problem.  Our  methods  are  efficiently  applicable  for  systems 
with  a very  large  number  of  files,  and  a moderate  number  of 
nodes.  Further  research  may  extend  applicability  to 
problems  with  a large  number  of  nodes  too.  The  size  of 
problems  that  we  can  solve  efficiently  (10,000  files  and  10 
nodes,  say)  would  certainly  be  considered  intractable  for 
other  existing  solution  techniques. 
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CHAPTER  7 

CONCLUSIONS:  A NEW  APPROACH  TO  LARGE  SCALE  SYSTEMS? 

We  have  studied  the  task  of  Resource  Management  in 
Large  Systems.  This  task  was  defined  as  having  three  main  1 

objectives:  Initial-Assignments,  New-Assignments , and 

Periodic  Review.  The  importance  of  this  task  was 
illustrated  by  examples  from  a large  warehouse,  and  from  a 
computer  network. 

The  Resource  Management  task  was  formulated  as  a 
certain  "Feasibility  Problem".  The  decentralized  approach 
was  made  possible  by  formulating  an  equivalent  "Artificial" 
optimization  problem,  and  decomposing  it  through  the  use  of 
Lagrange  Multipliers.  Chapter  3 studied  the  ex istence  of 
optimal  multipliers  for  this  artificial  problem,  while 
Chapter  4 developed  iteration  algorithms  to  find  these 
multipliers.  Chapter  5 demonstrated  the  application  of  our 
techniques  in  a practical  system,  and  Chapter  6 illustrated 
the  applicability  of  our  methods  to  the  file  allocation 
problem  in  computer  networks. 

i 

«*. 


I 


-183- 


The  advantage  of  decentralized  techniques  is  that  they 
make  possible  the  efficient  solution  of  very  large  problems. 
However,  the  applicability  of  these  techniques  has  been 
restricted  to  problems  which  satisfy  strict  conditions.  The 
main  contribution  of  our  work  is  to  extend  the  applicability 
of  decentralized  solution  methods  to  problems  where  the 
resource  usage  functions  are  not  well-behaved.  We  give 
conditions  for  the  existence  of  decentralized  solutions,  and 
also  give  algorithms  to  find  such  solutions,  without 
requiring  strict  conditions  on  the  functions. 

The  type  of  assumptions  and  conditions  required  for  our 
results  in  Chapters  3 and  *4 , reflect  properties  of  the 
system  as  a whole,  rather  than  the  properties  of  the 
individual  items  in  the  system.  We  feel  that  this  is  an 
important  viewpoint  for  dealing  with  large  systems: 

Results  in  many  areas  of  Large-Scale  Systems  theory 
depend  on  strict  conditions  on  the  functions 
characterizing  the  elements  of  the  system,  wnich  we 
might  call  the  local  properties  of  the  system.  We 
feel  that  the  structural  properties  of  a large 
i system  (which  we  might  call  the  global  properties) 

may  enable  us  to  make  statements  about  the  system, 
without  requiring  stringent  conditions  on  the  local 
properties.  This  may  greatly  extend  our  ability  to 
apply  known  techniques,  and  develop  new  techniques, 
for  problems  in  th t area  of  Large  Scale  Systems. 
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The  advantage  of  decentralized  techniques  is  that  ey 
make  possible  the  efficient  solution  of  very  large  problems. 
However,  the  appl icaoii ity  of  these  techniques  has  been 
restricted  to  problems  which  satisfy  strict  conditions.  The 
main  contr ibucion  of  our  work  is  to  extend  the  applicability 
of  decentralized  solution  methods  to  problems  where  the 
resource  usage  functions  are  not  well-behaved.  We  give 
conditions  for  the  existence  of  decentralized  solutions,  and 
also  give  algorithms  to  find  such  solutions,  without 
requiring  strict  conditions  on  the  functions. 


The  type  of  assumptions  and  conditions  required  for  our 
results  in  Chapters  3 and  4,  reflect  properties  of  the 
system  as  a whole,  rather  than  the  properties  of  the 
individual  items  in  the  system.  We  feel  that  this  is  an 
important  viewpoint  for  dealing  with  large  systems: 


Results  in  many  areas  of  Large-Scale  Systems  theory 
depend  on  strict  conditions  on  the  functions 


characterizing  the  elements  of  the  system,  wnich  we 
might  call  the  local  proper ties  of  the  system.  We 
feel  that  the  structural  properties  of  a large 
system  (which  we  might  call  the  global  properties) 


may  enable  us  to  make  statements  about  the  system, 
without  requiring  stringent  conditions  on  the  local 
properties.  This  may  greatly  extend  our  ability  to 
apply  known  techniques,  and  develop  new  techniques, 
for  problems  in  the  area  of  Large  Scale  Systems. 
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We  have  already  noted,  in  Chapter  1,  the 
growing  importance  of  Large-Scale  systems  and 
Decentralized  Control.  In  this  context,  two 
fundamental  questions  come  to  mind: 

* 1.  When  is  a system  "large"  ? 

2.  When  is  decentralized  control  "good  enough"  ? 

The  assumptions  and  results  of  our  work  suggest 
that  perhaps  these  questions  should  be  related  by 
definition , that  is,  a system  should  be  considered 
"large"  when  decentralized  control  can  provide 
acceptable  performance. 
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